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Permanence of Interest Scores Over 22 Years 


Edward K. Strong, Jr. 
Stanford Univer :ity 


A study of permanence of scores from the 
Vocational Interest Blank for intervals of time 
up to ten years was published in 1943 (2, Ch. 
15). The data were based on test-retests of 
college seniors, freshmen, and 11th-grade high 
school students. 

To the above are now added test-retest corre- 
lations for as long a period as 19 years for college 
freshmen and 22 years for college seniors. 

In addition to answering the question: how 
permanent are interest scores, the data throw 
light upon additional questions. First, is inter- 
est similar to memory in that as time goes on 
the agreement between test and retest becomes 
less? Second, are interests progressively better 
established as one grows older so that the older 
one is when tested the better the agreement be- 
tween that test and a subsequent one? Third, 
which affects permanency more in the period 
from 17 to 44 years of age: (a) interval of time 
or (b) age at time of taking the test? 

The data are based on: (1) seniors at Stan- 
ford University first tested in 1927 and retested 
in 1932, 1937 and 1949, and (2) freshmen first 
tested in 1930 and retested in 1931, 1939 and 
1949. Six diffeent intervals of time can be 
studied as to their effect upon permanency of 
interest scores from the data respecting seniors 
and, similarly, respecting freshmen. Aside from 
the interval of time factor in the test-retests 
there is also the age factor; the seniors varied 
from 22 to 32 years of age and the freshmen 
from 19 to 28 years of age at the time of taking 
the test. The combinations of time interval 
and age are set forth in Table 1. 


Method of Analysis 


There are several different ways of measuring 
permanence of interest scores but the method 


89 


used here is that of correlation between the 
scores constituting the interest profile of the 
test with the scores of the interest profile of the 
retest. In other words the 34 occupational 
standard scores of a man in the test are corre- 
lated with the 34 corresponding scores of that 
man in the retest. Summaries of fifty or more 
such correlations are given in Table 1. Such 
correlations indicate how likely it is that the 
same general deduction from all scores will be 
valid in the future. 

Correlations for men ranging from 17 to 32 
years of age and for intervals of time from 3 
weeks to 22 years are given in Table 1. The 
data for the high school students are from Finch 
(1). He used the mean to summarize the cor- 
relations from many cases. Our summary is in 
terms of the median of the coefficients. (With 
our data medians and means are practically 
identical, differing on the average by only .005.) 
In most groups the median is based on the first 
fifty cases in our file. The samples for seniors 
are not identical since some men did not take 
the test in 1932 or 1937; the same is true for the 
samples of freshmen. 

Ir. columns 7 and 8 of Table 1 correlations 
are given in which the scores of 100 cases have 
all been plotted in one scatter diagram and one 
correlation calculated for all the cases. These 
coefficients average .035 lower than the medi- 
ans of individual coefficients. It is probable 
that the data for the few individuals with very 
low correlations have affected the second set of 
coefficients to a greater extent than the first set. 
In any case we believe the median correlation 
of many individual records is more enlightening 
than the single correlation based on the inter- 
mingling of one hundred records. 
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Table 1 


Permanence of Interest Scores 











Age at 


Time of 
Test 


No. of Educational 
Subjects Level 
(1) (2) 


Interval 
in Years 
between Number 
Test and of Correla- 
Retest Subjects tion 
(5) (7) (8) 





11th grade 

11th grade 

11th grade 
College freshmen 
College freshmen 
College freshmen 
College freshmen 
College freshmen 
College seniors 
College seniors 
College seniors 
College seniors 
College seniors 
College freshmen 
College seniors 


_ 


seeseSessesseeske 


.06 
1.25 
2.33 
1 








Results 


The correlation coefficients are amazingly 
high for test-retest cases: .84 for 5 years; .82 for 
10 years; and .75 for 22 years with college sen- 
iors. A fourth of all the correlations in the six 
sub-groups of seniors are .90 or higher, up to 
.994, the highest, and 60 per cent are .80 or 
higher; see Table 2. 

We concluded in 1943 that “approximately 
the same rank-order is maintained for occupa- 
tional interest scores for intervals ranging be- 
tween one and ten years.”” Now we can add 
“‘between one and twenty-two years.”’ In other 
words, those who had interests most similar to 


Table 2 


Percentage Distribution of Correlations 
Shown in Table 1 








Six Groups 


Six Groups 
of Seniors 


of Freshmen 





25.7 
34.4 
26.3 29.0 
13.3 22.0 

0.3 2.3 


19.0 
27.7 





engineer, lawyer, or minister on the first occa- 
sion were the ones who had scores most similar 
to those same criterion groups on the second 
occasion, and vice-versa. The data establish 
the fact that there is greater permanence in 
interest scores among students than has gen- 
erally been believed. 

We concluded in 1943 that “permanence of 
interest scores is somewhat less than for intelli- 
gence test scores but more permanent over the 
college period than college grades, and distinctly 
higher than for attitude test scores.”” We now 
wonder whether or not intelligence test scores 
will prove to be any more permanent than in- 
terest scores for periods of ten to twenty years. 

The average reliability of the interest scales 
upon which the scores are based is about .89. 
On this basis the correlation between test and 
retest scores on a single scale should not rise 
above :89. The correlations considered here 
are not, however, correlations of scores on a 
single scale, but correlations between scores on 
34 different scales on the test and on the subse- 
quent retest. What the reliability is of 34 
scores in a profile has not been calculated. It 
seemingly must be higher than .89 or we would 





Permanence of Interest Scores Over 22 Years 91 


not have five correlations in Table 1 between 
.86 and .88 based on cases ranging from 17-year- 
old boys with an interval of three months to 32- 
year-old men with an interval of twelve years. 

Permanence of interest scores is affected by 
the two factors of age at the time of taking the 
test and interval of time between test and re- 
test. From the data in Table 1 it appears that 
the two factors are of equal importance, for the 
correlation between permanence and age with 
time interval partialed out is .818 and between 
permanence and time interval with age par- 
tialed out is .812. 

To the extent that permanence is affected by 
interval of time we may characterize interests, 
first, as similar to memories which fade with 
time, second, as phenomena which progres- 
sively change with increasing maturity, and 
third, as phenomena which shift with experi- 
ences peculiar to the individual. We need to 
know how much all changes in interests are 
caused by maturation, how much they are 
caused by individual experiences and how 
much by forgetting. 

According to our calculations, permanence of 
interests is affected as much by the age at 
which the test is taken as by interval of time. 
It has previously been shown that maturation 
explains many changes in interests—that inter- 
ests change rapidly from those held at 15 years 
to those held at about 25 years, and then shift 
in the reverse direction much more slowly from 
about 25 years to 55 years of age. Of the 
change from 15 to 25 years, roughly speaking, 
one-third of the change in interests is between 
15.5 and 16.5 years, one-third between 16.5 and 
18.5 years, and one-third between 18.5 and 25 
years (2, Chs. 12; 13). 

As the changes in interests because of matu- 
ration are relatively small from 22 to 32 years 
of age we should expect high permanence of 
interests from men within this age range. This 
is what is shown in the table, for all the corre- 
lations are between .819 and .880 with one 
exception and that is for 22-year-old seniors 
retested twenty-two years later. On the other 
hand, the lowest correlations in the table, i.e., 
.667 to .725, are for younger men, 19 to 20 years 
old, retested 8 to 19 years later. It is unfor- 
tunate that there are no data of still younger 
men tested over long periods of time, and of 


older men tested for very short intervals of 
time. 

At least to the extent that permanence in- 
creases with age, interests are not to be associ- 
ated with memory for memories do not improve 
as one grows older. If anything, the reverse is 
the situation. We feel, at least, that it is 
harder and harder to remember faces and names 
as we become older. How far early memories 
are better established than older ones because 
of youth and how much because of repetition is 
unknown to the writer. 

Evidently, interests become better and better 
established with increasing age and consequent- 
ly the older the man when first tested the more 
likelihood that a test and retest will agree. 
But the data in the table indicate that although 
permanence is less for young men than older 
men, it is still remarkably high for young men. 
The correlation of .81 for 17-year-old boys over 
an interval of 2.3 years and of .72 for 19-year- 
old college freshmen over an interval of 19 
years are evidence of high permanency of inter- 
ests among young men. 

Interests are not separate entities but are an 
aspect of activities, which are habits. As we 
grow older certain activities are discarded but 
many more are engaged in continuously. A 
human being is more than a mere collection of 
activities, but the sum total of his activities 
constitutes a vital part of his' makeup. It is 
this part of a man we measure with an interest 
test. Indirectly an interest test must measure 
to some extent man’s underlying wants and 
desires, on the one hand, and his abilities, on 
the other hand, for the activities. must repre- 
sent what he wants to do modified by what he 
is capable of doing. 

In any attempt to test a man in order to 
discover what kind of person he is it is evident 
that an interest test should be included in the 
battery. 


Received June 17, 1950. 
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The purpose of the present study was to de- 
termine the magnitude of relationship which 
exists between job morale and American Med- 
ical Association binaural hearing loss. Sub- 
jects for the research were 91 per cent of the 
entire personnel of the (non-union) Chicago 
plant of the Automatic Spring Coiling Com- 
pany, a total of 53 personnel cooperating. While 
the sample is not large, the fact that the sample 
is almost a total one containing a random col- 
lection of levels of adult hearing efficiency is 
reassuring for this research. 

The personnel manager sent each worker a 
letter announcing the Illinois Institute of Tech- 
nology research program, stating that all coop- 
eration voluntarily given would be without ref- 
erence to any worker’s identity, and that the 
audiogram would be presented to each coop- 
erating worker. 

Hearing Loss. “Binaural hearing loss was 
computed on each employee according to the 
specifications of the American Medical Asso- 
ciation, after testing in a quiet private office by 
an Otarion-trained audiometrician with a Model 
600 Otarion Audiometer, a standard instrument 
measuring acuity at each of the conventional 
frequencies. Only one of the 53 workers wore 
a hearing aid; the remainder except one appar- 
ently regarded themselves as normal in hearing 
ability. In the entire group, hearing loss 
ranged from less than one per cent to 75 per 
cent. 

Job Morale. Employing the anonymous (but 
secretly sequence research-coded) Tear Ballot 

for Industry (1), ten job satisfaction variables 
were measured on a five-point scale and sum- 
mated: job security feeling, belief in employee 
welfare interest of company, attitude toward 

* The authors acknowledge with gratitude the coop- 


eration in this research of the Automatic Spring Coiling 
Company, Roy O. Carlson, and James Fairchild. 
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immediate supervisor, attitude toward working 
conditions, attitude toward co-workers, atti- 
tude toward pay, attitude toward communi- 
cation freedom, confidence in good sense 
of management, and opinion of effect of com- 
pany experience on personal happiness. These 
scores with a total possible range from 10 to 50 
actually ranged from 27 to 48. Age also is 
reported (in five-year multiples) on the Tear 
Ballot. 

In addition to this general job satisfaction 
score, two sub-scores were computed on each 
employee—interpersonal satisfaction score and 
“{mpersonal” satisfaction score. The inter- 
personal score consisted of a summation of 
responses to ‘How does your immediate supe- 
rior compare with other managers, foremen, or 
section leaders as to supervisory ability,” 
‘Are most of the workers around you the kind 


Table 1 


Tetrachoric Intercorrelations Among Certain Hearing 
Loss and Morale Values of 53 Employees 
in a Coil Spring Factory* 








Variables 2 S28 





. Per cent hearing loss 
(AMA) AS —.25 43 .23 
. Age 34 85 .04 
. Job satisfaction 31 68 
. Past mean job tenure 14 
. Interpersonal jrb 
satisfaction 
. “Impersonal” job 
satisfaction 
Partial correlations: 
ris. = —.42 
Tu2= .24 
rise = —.46 


fur = 12 
fu2 = 12 
'ua = 44 





* Coefficients statistically significant at the five per 
cent level or better are printed in boldface type. 
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who will remember you when you pass them on 
the street,” and “Do you feel that you have 
proper opportunity to present a problem, com- 
plaint, or suggestion to the management,” as 
such replies theoretically represent an em- 
ployee’s interpreted success, adaptation, and 
adjustment in interpersonal relations. The 
“{mpersonal” score consisted of a summation 
of the other seven items of the original ten. 
Turnover Rate. A separate information slip 
requested number of previous civilian employ- 
ers and also number of years work experience 
under these same employers. Thus “years” 
divided by “employers” allowed computation 
of a past mean tenure (reverse of turnover) 
value on each worker, as an independent esti- 
mate of the worker’s past adjustment. These 


mean past tenures, admittedly responsive to 
myriad factors other than adjustment, ranged 
from half a year to 24 years. 

Intercorrelations and some partial correla- 
tions were computed to exploit the obtained 
data, and these are presented in Table 1. 


Results 


As indicated in Table 1, the various criteria 
of worker morale are contaminated with the 
age factor. When the age factor is held con- 
stant by partial correlation, a statistically sig- 
nificant negative relationship survives between 
hearing loss and total job satisfaction. The 
magnitude of this correlation (—.42) suggests 
that about one-sixth of the variance in job 
satisfaction is accounted for by variance in 
hearing ability. When the total Tear Ballot 
score is broken into the interpersonal relations 
and the relatively “impersonal” sections, the 
interpersonal job satisfaction section fails to 
correlate significantly with hearing loss when 


age is held constant; this is an implied contra- 
diction of the clinical theory (2) that hearing 
loss is specifically a paranoid pitfall. The 
remaining more “impersonal” job satisfaction 
value correlates —.46 with hearing loss when 
age is held constant. 

Mean tenure in past jobs is unrelated to hear- 
ing loss when age is held constant. When both 
age and mean past tenure are held constant, 
total Tear Ballot job satisfaction still bears a 
relation of —.44 with AMA hearing loss. 


Summary 


Fifty-three (91 per cent of total personnel) 
employees of a small factory were measured for 
AMA binaural hearing loss, job satisfaction, 
mean tenure in past jobs, and age, in an attempt 
to measure some of the possible psychological 
effects of hearing loss on worker morale. 

1. When age is held constant by partial cor- 
relation, variance in hearing loss appears to 
account for one-sixth of the variance in job 
satisfaction (r = —.42). Hearing loss may be 
considered an important factor in job morale. 

2. Attitudes towards associates seem to be 
much less influenced by hearing loss (r = .24) 
than are the more “impersonal” aspects of job 
satisfaction (r = —.46). 

3. Mean tenure of past jobs is unrelated to 
hearing loss when age is held constant. 


Received. May 5, 1950. 
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Output Rates among Chocolate Dippers 


Harold F. Rothe 


Stevenson, Jordan & Harrison, Inc., Chicago, Illinois 


There is a growing recognition among psy- 
chologists of the difficulties of obtaining ade- 
quate criteria in industrial work. An increasing 
amount of space is being devoted to this topic 
in recent books and conferences. These diffi- 
culties are as true of objective performance 
data as of subjective criteria such as merit rat- 
ings. In a recent paper, Einar Hardin made 
particularly clear the difficulties involved in 
obtaining an adequate performance criterion.' 


Background of the Problem 


In a previous paper the writer presented data 
on the output of butter wrappers, showing that 
a large number of daily work curves (based on 
production during each fifteen-minute period) 
must be obtained before a stable or consistent 
curve for an operator could be obtained. Daily 
work curves for individuals had but little con- 
sistency from day to day (1). These butter 
wrappers showed an enormous range of produc- 
tivity from one fifteen-minute period to the 
next, so great that the variations within one 
operator’s performances over a five-day period, 
were greater than the average variation from 
one operator to another over the same period 
(2). 

In another paper on machine operators it 
was shown that the operators had relatively 
little consistency from one two-week period to 
the next, these correlations being .57 and .68 in 
that study (3). 

On the other hand, Tiffin has described a 
week to week consistency of .96 for hosiery 
loopers (4). It is probably noteworthy that 
the hosiery loopers were paid on an incentive 
system, and the butter wrappers and machine 
operators were not. It is also probable that 
other factors such as shortages of material, 
good or poor scheduling, amount of work ahead 
of each operator, validity of recorded data, and 
even more subtle factors affect the consistency 


1A paper. Pe a at the Sixth Annual Conference 


on Industrial Relations Research held at the University 
of Minnesota, May 9 and 10, 1950. 


of the operators from period to period. It is 
certainly clear that a great deal more detailed 
data must be obtained, together with a com- 
plete description of the setting in which the 
study took place, before industrial psycholo- 
gists can reach sound conclusions regarding 
output data. 

The purpose of the present paper is to pre- 
sent another study, and a description of the 
setting of that study, as another step in the 
development of an understanding of output 
rates. 


Background of the Study 


The data for this study were taken from the 
official books of the Fannie May Candy Com- 
pany of Chicago, Illinois.2 They cover the 
period December 16, 1949 through April 15, 
1950. They refer to regular, experienced female 
employees, working at their regular jobs at 
their usual workplaces. This job is described 
in the USES Dictionary of Occupational Titles, 
code number 6-05.312. 

The Fannie May Candy Company is charac- 
terized by friendly employee-management rela- 
tions. There has long been in effect an annual 
bonus system, a pension plan, a hospital plan, 
and similar aspects of progressive employee 
relations. There is an employee representative 
committee that represents the employees. 
Further, each employee feels free to walk into 
the General Manager’s ofhve at any time. 

There is an incentive system that has been in 
effect for many years, and the employees are 
paid a one-to-one ratio for performance over 
standard. It should be noted particularly in 
the data that follow here that this incentive 
system is an effective one, that is, the employees 
actually do regularly produce more than 100 
per cent of performance, and get paid accord- 
ingly. Finally, this job is completely con- 
trolled by the operator; there are no moving 

* The writer wishes to thank Mr. Harry H. Simpson, 


Vice-President of the Fannie May Cay Company, for 
permission to publish these data. 
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belts or machinery that govern the pace of the 
hand dippers. 

Since this study was limited to experienced 
hand dippers, and does not include persons on 
other jobs, part-time dippers, or relatively inex- 
perienced dippers or night-shift dippers, the 
sample is small, being only 18 girls. No data 
were included for any girl who did not work at 
least one full day on dipping in any one week, 
ond, hence, the size of the sample dropped occa- 
sionally to a smaller number, but on no occasion 
were there less than 15 girls included in the data 
for any one week. 


Table 1 


Weekly Average Output (Per Cent Performance of 
Standard) for Group of Chocolate Dippers 








Number 
of Girls 
Weekly 


Per Cent 


Week Ending Performance 





December 31 
January 


128.4 
127.5 
129.5 
128.8 
129.0 
126.9 
131.5 
127.5 
127.9 
126.7 
127.2 
127.2 
132.7 
134.2 
128.4 
126.0 


February 





Data 


Analysis of the trend of average weekly per- 
formance of the chocolate dippers shows that 
there was no consistent long-term upward or 
downward trend in productivity during this 
period. These data are shown in Table 1. 
Thus, it may be concluded that the period of 
time studied here was an essentially “normal” 
one. 

The correlation of each operator’s perform- 
ance for one week with her performance for the 
following week was determined by the rank dif- 
ference method. The distribution of the ob- 
tained Rho’s is shown in Table 2. The median 


TABLE 2 


Frequency Distribution of Rho’s between Successive 
Weeks’ Output Individual Performance for 
Group of Chocolate Dippers 





Rho Frequency 





.96-1.00 
91- 95 
.86- .90 
81- .85 
.76- .80 
75 
.66- .70 
65 
56- .60 
Median Rho = .85 
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inter-weekly correlation is .85. Thus each girl 
was quite consistent in her performance from 
week to week. 

The greatest and least amount of produc- 
tivity for each girl for any one of the 16 weekly 
periods is shown in Table 3, together with the 
ratio of best to worst performance for each girl. 


Table 3 


Highest and Lowest Average Weekly Performances, 
and their Ratios, for Individual Chocolate 
Dippers During 16-Week Period 








Highest 
Weekly 
Average 


Ratio of 
Highest 
to Lowest 


Lowest 
Weekly 
Average 





135 1.16 
123 1.29 
142 1.18 
175 1.22 
134 1.24 
114 1.10 
131 1.27 
149 1.16 
146 1.11 
136 1.09 
138 1.28 
150 1.10 
124 1.09 
145 1.24 
140 1.11 
147 1.43 
140 1.24 
133 1.11 
152 1.20 
Median intra-individual ratio = 1.18 
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Table 4 


Highest and Lowest Average Individual Weekly 
Performances, and their Ratios, for Group 
of Chocolate Dippers During 
16-Week Period 





Highest 
Girl’s 
Average 


Lowest 
Girl’s 
Average 


Ratio of 
Highest 


Week Ending to Lowest 





December 31 147 105 

January 149 101 

150° 102 1.47 

152 107 1.42 

150 1.52 

February 149 ¢ 1.51 

175 1.73 

144 1.47 

157 1.65 

1.58 

1.42 

1.47 

1.33 

1.53 

1.55 

15 1.44 
Median inter-individual ratio = 1.475 


1.40 
1.48 


March 





The median performance ratio of best to worst 
for these girls is 1.2. Thus, girl A produced a 
high of 135 per cent for one of these 16 weeks, 
and her lowest weekly average production dur- 
ing this time was 116 per cent, giving her a 
best/worst, or intra-individual ratio of 1.16. 

The ratio of best operator to worst operator, 
for each week, is shown in Table 4. During the 
week ending December 31, 1949, one girl pro- 
duced 147 per cent and another girl produced 
105 per cent, for an inter-individual ratio of 
1.40. 

From Tables 3 and 4 it can be seen that the 
average ratio of the range of inter-individual 
performance is greater than the average ratio 
of the range of intra-individual performance. 


Discussion 


The present writer has previously stated two 
tentative hypotheses relating to the effective- 
ness of incentives. One was that “the incen- 
tives to work may be considered ineffective 
when the ratio of the range of intra-individual 
differences is greater than the ratio of the range 
of inter-individual differences” (2, p. 326). It 


is quite apparent that here, where we may safely 
conclude that the various incentives were effec- 
tive, that this hypothesis is supported by these 
data. 

The second hypothesis was that “if the inter- 
correlation of group output rates for two periods 
closely related in time is less than .80 the incen- 
tivation is not highly effective, while intercor- 
relation higher than .90 indicates effective 
incentivation” (3, p. 488). This hypothesis is 
not supported by this study, although the error 
is in amount, not in direction. It is, of course, 
risky business making a hypothesis with as 
little data as exist in this field. On the other 
hand, hypotheses may stimulate someone else 
to do research (and to disprove them). 

On the strength of other data which have not 
been published, the writer believes that there 
is a relationship between consistency of data 
and effectiveness of incentives. He also be- 
lieves that the incentivation of the machine 
operators (previously described) was actually 
fairly high (3), even though there was no incen- 
tive system there at the time. That is, high 
incentivation probably existed there because of 
an intangible attitude and in spite of the lack 
of an engineered incentive system as such at 
the time of that study. 

Hence, the second hypothesis is herewith 
changed to read, “‘if the intercorrelation of out- 
put rates for two periods closely related in time 
is less than .50 the incentivation is not highly 
effective, while intercorrelation higher than .80 
indicates very effective incentivation.”’ 

This hypothesis is of course tentative. It is 
also likely that these coefficients will vary with 
the length of the periods of time correlated. 
Presumably lower correlations will be found if 
short periods of time are correlated than if long 
periods are correlated. 

Another point is also important here. It is 
apparent that the writer is attempting to dem- 
onstrate something other than “reliability” by 
proposing these hypotheses. That is, it ap- 
pears that the consistency of output, from time 
to time, may reflect a real phenomenon and 
permit some prediction (such as an estimate of 
the effectiveness of the incentives in a situa- 
tion) and not merely reflect the “reliability” or 
“unreliability” of the criterion. The concept 
of “reliability” has been taken over from the 
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realm of testing where it is related primarily to 
errors of measurement. Such errors undoubt- 
edly exist in work such as is described in this 
article. But it is also very likely that there is 
a “consistency” or “inconsistency” of indus- 
trial output data that is a real phenomenon and 
not merely an error of measurement.* 


Summary 


Analysis of the output rates of 18 chocolate 
dippers for a sixteen-week period, under condi- 
tions of excellent incentivation, reveals an aver- 
age week to week consistency coefficient of .85 
and an average ratio of the range of inter- 
individual performance greater than the average 
ratio of the range of intra-individual perform- 


3 This point has also been expounded by Einar Hardin 
previously cited. It is sincerely hoped that he will 
soon publish his paper on the subject. 


ance. Both these findings tend to support 
some hypotheses previously stated by the 
writer, although with changes in one of the 
hypotheses. 

The probability that coefficients of consis- 
tency of output rates, from time to time, reflect 
some real phenomenon or co-variate, and are 
not merely indicative of errors of measurement 
(“reliability”) is discussed. 


Received June 10, 1950. 
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The Prediction of Driving Skill 
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The most frequently utilized index for gaug- 
ing proficiency in the operation of motor vehicles 
is accident rate. As a consequence, in most 
instances the validity of tests for use in the 
selection of motor vehicle operators is deter- 
mined by relating scores to some measure of 
safety. Since the supposition is that to a con- 
siderable degree accidents are due to lack of 
driving skill, selection tests largely are of the 
apparatus type designed to measure basic motor 
and perceptual capacities. 

Two of the present authors recently consid- 
ered the problem of the usefulness of paper and 
pencil tests in the prediction of accidents of 
motor vehicle operators (1). Scores on a paper 
and pencil test battery, consisting of tests of 
dotting, tapping, linear distance discrimination, 
judgment of distance through perspective and 
interposition, and mechanical principles, were 
found to be significantly related to accidents 
among taxi cab drivers. This battery previ- 
ously was found to be effective in the prediction 
of the number of accidents incurred by street 
car motormen. Since this battery had proved 
to be effective it was argued that at least for 
selective purposes paper and pencil tests might 
be devised which would be as effective as the 
more expensive and cumbersome performance 
tests. 

One shortcoming of paper and pencil tests 
lies in the fact that while the scores they yield 
might be predictive of the degree of safety in 
performance, they are of far less value than 
performance tests for purposes of diagnosing 
particular deficiencies which might well lead to 
accidents. Knowledge of such deficiencies 
would be most helpful in remedial training. 
The possibility remains, however, that paper 
and pencil tests might be useful in indicating 


which individuals are deficient in driving skill. 
By retraining in basic driving skills operating 
safety presumably would be improved. It was 
the purpose of the present investigation to deter- 
mine the validity of the aforementioned paper 
and pencil tests in the prediction of driving 
skill. If it appears that the validity is signi- 
ficant then such tests might be utilized as part 
of a diagnostic battery. 


Methods and Procedure 


An opportunity arose in a large-scale driver 
testing program for comparing the paper and 
pencil tests with two of the most frequently 
used driver performance tests in terms of their 
relationships with work-sample tests involving 
the actual operation of motor vehicles. Two of 
the authors (Bartelme and Fletcher) conducted 
a drivers clinic at one of the larger army bases 
in which the driving skills of over two hundred 
operators were evaluated. In addition to being 
examined on the paper and pencil tests the 
operators were examined on the steering and 
brake reaction time test in use by the California 
Department of Motor Vehicles in their drivers 
clinic. 

After completing these tests the drivers were 
given a modified version of the exercises for 
driving skill developed under the sponsorship 
of the American Automobile Association (2). 
There were four of these work-sample exercises 
as follows: 

1. Driving forward and backward on a 
straight line for a distance of 100 feet. 

2. Driving forward and backward in a nar- 
row lane affording only about two feet clear- 
ance on each side. 

3. Aligning the front and back wheels and 
the front and back extremities of the car with 
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Table 1 
Coefficients of Correlations between Various Tests and Total Score on the Work-Sample Driving Exercises 





rators of 
ium Vehicles 


Operators of All 
Light Vehicles M Operators 


Type of Test a N r - ** 


Operators of 
Heavy Vehicles 








Dotting .03 08 
Tapping 10 13 
Distance Discrimination 46 01 
Mechanical Principles Al 24 
Judgment of Distance .08 —.01 
Paper and Pencil Test Battery .24 37 ll 
Steering .23 72 20 
Reaction Time —.11 42 


—.16 — .02 
—.05 06 

-16 .21 
— .03 .08 
—.05 —.01 


—.12 05 
‘ 18 .20 
ll 09 02 





* These coefficients are averages from two or more vehicle groups, using the Fisher £ method. 


crossmarks on the roadway, both forward and 
backward movement of the car being used. 

4. Brake reaction time (movement of foot 
from accelerator. to brake), and stopping dis- 
tance using the detonator test. 

Each driver was allowed to perform the fore- 
going exercises with the type of car he custom- 
arily drove in his employment. The perform- 
ance on each exercise was scored in terms of 
judgments of both qualitative and quantita- 
tive characteristics. The total score consisted 
of a simple sum of the point scores of the four 
exercises. This score was used in evaluating 
the several tests. 


Results 


Table 1 presents the coefficients of’ corre- 
lation between the total scores on the work- 
sample driving exercises and scores on the vari- 
ous tests. Under light vehicles are included 
passenger cars and # ton trucks. Medium 
vehicles were 1} ton trucks. Heavy vehicles 
included 2} ton trucks and buses. When two 
or more types of vehicles were combined, the 
resulting coefficient was obtained by averaging 
the coefficients for the several types through 
use of the Fisher ¢ method. 

It will be noted that the individual paper and 
pencil tests show little promise of predicting 
actual performance. The Distance Discrim- 
ination Test gave the highest coefficients. This 
result can be rationalized in terms of the visual 
characteristics involved in the particular exer- 
cises used in the work-sample tests. The Dis- 


tance Discrimination Test would seem to merit 
further study in driver testing situations. 

Although the evidence can be considered 
merely suggestive, it would appear that the 
relationships between the paper and pencil tests 
and the work-sample tests is not the same for 
heavy vehicles as for light vehicles. In the 
case of heavy vehicles, the greater number of 
negative coefficients, even though they are of 
small size, suggests the possibility of different 
types of functions underlying performance on 
light and heavy motor vehicles. 

Of the two performance tests the steering 
reaction test is superior to the brake reaction 
time test. The steering test gave consistently 
positive results for the various kinds of vehicles. 

The paper and pencil tests were combined in 
a battery, the weights being those determined 
in ptevious investigations with vehicle oper- 
ators (1). For light cars the paper and pencil 
test battery shows as high a relationship with 
the work-sample performance as does the steer- 
ing test, but is inferior to this test in terms of 
relationship with performance on the heavier 
vehicles. It would appear that insofar as the 
tests used in the present study are concerned, 
the advantage lies with the performance tests, 
and particularly with the steering test. 


Conclusions 


It appears from the findings of the present 
investigation that the prediction of driving skill 
by means of tests is by no means wholly satis- 
factory. This conclusion is consistent with the 
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results of other investigations (3). It can be 


concluded that insofar as driving ability was 
measured at all by the tests used, the steering 
test consistently gave the best results for all 
types of automotive drivers, and that for oper- 
ators of light vehicles a battery of paper and 
pencil tests was equally good. 


Summary 


Achievement scores of operators of motor 
vehicles on five paper and pencil and two per- 
formance tests were compared with their scores 
on work-sample driving tests involving the 
actual operation of motor vehicles. For the 
most part the coefficients of correlation were 
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low. A steering performance test gave the 
most consistent results. With the exception of 
drivers of light vehicles, paper and pencil tests 
were found to give inadequate predictions of 
driving skill. 


Received May 29, 1950. 
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Variability of Reaction Time and Susceptibility to 
Automobile Accidents * 
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Reactions to visual and auditory stimuli are 
made more or less continuously by the automo- 
bile driver, particulariy on a heavily-traveled 
street or highway. Individual differences exist 
in the speed with which drivers are able to 
respond to these stimuli. Measurements of 
reaction time have been traditionally included 
in experimental test batteries designed to pre- 
dict accident susceptibility of drivers, on the 
premise that the number of accidents or the 
accident-rate of an individual driver is partially 
a function of his reaction time. 

However, the reported correlations between 
reaction time and accident rate have been dis- 
appointingly low. One explanation which has 
been advanced for these low correlations is that 
the “slow reactor’ soon becomes aware of his 
“handicap” and makes adjustments or com- 
pensations accordingly in his driving habits. 
Another explanation is that it is variability of 
reaction time which is the reaction-time factor 
related to accident-rate rather than reaction 
time per se. In other words, it may be that 
the driver who is consistent in the speed with 
which he reacts to a stimulus, regardless of how 
high or how low his mean reaction time may be, 
is least susceptible to accidents. 

The purpose of this study was to test the 
hypothesis that an individual whose reaction 
times are relatively consistent throughout a 
short series of measurements, regardless of the 
magnitude of his mean reaction time, will be 
found to be less susceptible to automobile acci- 
dents than is the individual whose reaction 
times are more variable. 

A review of the published studies since 1930 
has shown them to be inconclusive on one or 
more points. 

Johnson (3), in his review of the literature 
pertaining to the detection and treatment of 


* The writers gratefully acknowledge the cooperation 
of Capt. N. A. McQuown and Sgt. F. L. Crewe of the 
Traffic Education Unit, Los Angeles Police Department, 
in conducting this study. 
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accident-prone drivers, refers to an unpublished 
study by Cobb (1) in which a correlation coeffi- 
cient of .06 + .03 was obtained between vari- 
ability of reaction time and propensity to head- 
against-rear accidents or passing collisions over 
other types of accidents. In Cobb’s study, 
measurements were made of simple reaction to 
an auditory stimulus using a sample of 3,663 
drivers licensed in Connecticut. Thirty-six 
measurements were made per subject. The 
criterion data were secured from the state rec- 
ords of accidents. Johnson states that “the 
records are not complete, except with respect 
to fatal accidents. It is probable that not 
more than half the accidents which are report- 
able under the law, but which do not involve 
deaths or personal injury are actually reported, 
unless the drivers are insured.” Johnson also 
criticizes Cobb’s sample on the basis that “‘it is 
non-typical of the general driver population of 
the state, for it contained many drivers whom 
the authorities had sent in for examination 
because they had recently got into trouble.” 
As Johnson states, this no doubt tended to bias 
the distribution and limits the interpretation 
which may be made of the results obtained. 
The particular measure of variability which 
Cobb employed in his study is not given. 

In an independent investigation reported by 
Cobb (1), A. R. Lauer secured a correlation 
coefficient of —.12 between average variation 
in reaction time and accident-rate of commer- 
cial drivers. It is not stated whether the stimu- 
lus employed was visual or aduitory nor whether 
the reaction was simple, complex or choice. 
The number of reaction-time measurements 
recorded for each subject is not given. Lauer 
examined 1,014 commercial drivers in his inves- 
tigation. Accident records of the drivers were 
obtained from the drivers’ own statements and 
from company records. No public records 
wereemployed. Accident-rate was established 
by dividing total number of accidents by the 
longest driving period given by the driver. 
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Lauer attributes the low correlations obtained 
in his study to the weakness of accident records 
as a basis for the criteria and to the “multi- 
plicity of causes to which accidents may be 
ascribed.” 

Procedure 


Simple reaction-time measurements were 
made by means of a reactometer. The reac- 
tometer consists essentially of a chassis on 
which are mounted a seat, steering wheel, clutch 
pedal, brake pedal and accelerator pedal. A 
small, red stimulus light appears before the sub- 
ject on a screen which is an integral part of the 
apparatus. The screen is approximately five 
feet from the subject’s eyes as he is seated 
behind the steering wheel. The accelerator 
pedal and brake pedal are connected electrically 
toa timer. The timer records, in hundredths 
of a second, the time required for the subject to 
move his foot from the accelerator pedal to the 
brake pedal in response to the red stimulus 
light. The stimulus light appears at twelve 
different instances throughout a total testing 
period of 80 seconds. The time-intervals be- 


tween appearances of the stimulys light are not 
of equal duration. 


This minimizes the possi- 
bility of the subject anticipating the appear- 
ance of the stimulus light. The first 20 seconds 
of the testing period are for practice purposes. 
The stimulus light appears twice during this 
practice period, but_these two reaction times 
were not recorded in this study. The remain- 
ing ten reaction times were recorded and they 
represent the basic experimental test data used 
in the study. 

The reactometer test was administered to 
104 Radio Patrol Officers of the Patrol Bureau, 
Los Angeles Police Department, during the 
months of September, October and November, 
1948. This sample of 104 officers was used in 
computing the statistical reliability of the test 
data and in computing correlation coefficients 
between age and mean reaction time, between 
age and standard deviation of reaction time and 
between mean reaction time and standard devi- 
ation of reaction time. 

A second sample was selected from the orig- 
inal sample of 104 officers for the purpose of 
computing correlation coefficients between the 
criterion data and the three variables, age, 
mean reaction time and standard deviation of 
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reaction time. This second sample was select- 
ed on the basis of length of service with the 
Police Department. Since accident records 
were to be used as the basic criterion data, it 
was necessary that each officer in this second 
sample be a member of the Police Department 
throughout the selected criterion period. The 
second sample so selected consisted of 87 
officers. 

Data describing the age and the length of 
service of each of the two samples are summar- 
ized in Table 1. Means and standard devia- 
tions shown in Table 1 were computed from 
ungrouped data. 

The indices considered for each subject in 
this study were the mean of his ten reaction- 
time measurements and the standard deviation 
of these measurements. The reliabilities of 
these indices were computed by the split half 
technique from alternate raw scores. That of 
the means was thus found to be .87 which cor- 
rected for the number of measures yielded a 
reliability coefficient of .93. The reliability 
of the standard deviations was .27 which sim- 
ilarly corrected yielded a reliability coefficient 
of .41. 

The mean and standard deviation of the pop- 
ulation of mean reaction times of individual 
subjects were calculated for both the sample of 
104 officers and for the sample of 87 officers. 
Similarly, the mean and standard deviation of 
the population of the standard deviations of 
individual subjects were calculated for each of 
the two samples. In other words, these latter 
calculations yielded a “mean of the mean reac- 
tion times of the subjects” and a “mean of the 
standard deviations of reaction times of the 
subjects,” as well as a standard deviation in 
each of the two instances. These statistics are 
given in Table 2. All calculations were made 
from ungrouped data. 


Table 1 


Statistical Description of Samples 








Standard 
Deviation Range 
22-55 
24-55 


.5-24.0 
2.0-24.0 


Variable N Mean 


104 34.2 7.8 
87 35.9 7.4 





Age 


Yearsin Department 104 7.5 5.5 
87 8.7 $2 








Reaction Time and Susceptibility to Automobile Accidents 


Table 2 


Mean, Standard Deviation and Range of Mean 
Reaction Times and Standard Deviations 
of Reaction Time* 





Variable 
(Based on 10 
Reaction-Time 
Measurements) 


Standard 
N Mean Deviation Range 





Mean Reaction Time 104 43.1 43 
87 43.3 4.5 


Standard Deviation 104 3.93 1.53 
of Reaction Time 87 3.87 1.43 


34.8-56.3 
34.8-56.3 


1.56-9.83 
1.56-9.61 





* Data are shown in hundredths of a second. 


The range of the standard deviations shown 
in Table 2 (1.56-9.83) indicate that differences 
do exist in the time consistency with which in- 
dividuals react to a visual stimulus and that 
those individual differences are of considerable 
magnitude. 

The basic criterion data consisted of the offi- 
cial reports of accidents sustained by the offi- 
cers while on duty as drivers in police radio 
patrol cars. Police Department regulations 
require that each accident in which an officer 
is involved be reported regardless of how minor 
it may be. This gave reasonable assurance 
that the accident reports secured for each officer 
in the sample represented a complete record of 
his accidents. Accidents in which a patrol car 
is involved are investigated and reported upon 
by an accident investigation unit of the Police 
Department. 

The criterion period extended from January 
1, 1947 to January 31, 1949, inclusive. During 
this period the sample of 87 officers sustained a 
total of 34 accidents. Sixty-two officers, repre- 
senting 71 per cent of the sample, had no acci- 
dents during the criterion period. In other 
words, 25 officers, or 29 per cent of the sample, 
were responsible for the 34 accidents. One of 
these 25 officers had three accidents, each of 
seven had two, and the remainder had one 
each. A test outlined by Mintz and Blum (4) 
for examining significance of differences be- 
tween distributions of multiple accidents and 
the expected Poisson distribution was per- 

1 It is recognized that the over-all exposure to which 


drivers of police — cars are subjected is not com- 
parable to that of the general driver population. 
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formed. The results indicated that the ob- 
tained distribution did not differ significantly 
from chance and hence that no conclusion as to 
“accident proneness” of any of the subjects 
was justified. 

A subjective analysis was made of each acci- 
dent report and each accident was classified in 
one of the three different categories defined 
below: 

“A” accident—an accident in which the offi- 
cer was apparently at fault; an “avoidable” 
accident. 

“B” accident—an accident in which respon- 
sibility was doubtful or indeterminable. 

“C” accident—an accident in which the offi- 
cer was apparently not at fault; an “unavoid- 
able”’ accident. 

This analysis resulted in 13 accidents being 
classified as “‘A”’ accidents, 8 being classified as 
“B” accidents and 13 being classified as “C” 
accidents. Two of the officers who had more 
than one accident had one each “fA” and “C”’; 
2 had one each “A” and “B”’; 2 had one each 
“B” and “C”’; 1 had two “‘A’s”; and 1 had an 
da» a oon? and a is Sl 

Accident-rate was not computed for each 
subject since exposure to accidents, in terms of 
miles and hours driven, time of day on duty and 
general driving environment, was considered 
to be sufficiently homogeneous between the 
four groups of subjects in the sample. The 
four groups are comprised of the three accident 
groups and the no-accident group. The groups 
are mutually exclusive since the individuals 
having more than one accident were assigned 
to only one group with an “A” accident taking 
precedence over a “B” which in turn took pre- 
cedence over a “C.”’ 

Correlations and intercorrelations between 
the variables were calculated by the tetrachoric 
technique described by Garrett (2).2 In con- 
structing the tetrachoric tables the line of divi- 
sion was established at the mean for each of the 
three variables, age, mean reaction time and 
standard deviation of reaction time. Three 
different “splits” were made of the accident 


2 This correlation technique was considered appro- 
priate as an initial method of analysis in this particular 
study. If results obtained by application of the tetra- 
choric technique are not statistically significant, there 
is little, if any, justification for applying a more refined 
technique of analysis. 
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Table 3 
Basic Test Data 





Mean SD 
Reaction 
Time 


Mean 
Reaction 
Time 





A 40.9 

B 46.76 

Cc 44.41 
No Accident 43.3 


3.54 
3.48 
3.79 
3.97 





data. These three “splits” are defined as 
follows: 

1. “No” accidents vs. “A,” “B” or “C” ac- 
cidents. 

2. “B,” “C” or “No” accidents vs. “A” 
accidents. 

3. “C” or “No” accidents vs. “A” or “B” 
accidents. 

Further analyses, as indicated by the results 
obtained with the tetrachoric r technique, were 
made using the analysis of variance technique 
described by Snedecor (4). 


The SE of a tetrachoric coefficient of zero for 
N = 87 is .17. For N = 104 the SE is .15. 

Only ‘one of the nine validity coefficients 
shown in Table 4 is statistically significant. 
The correlation of —.30 between mean reac- 
tion time and accidents (“split” on the basis 
“B,” “C” or “No” accidents vs. “A” accidents) 
is significant at the five per cent level of confi- 
dence. This would indicate that the individ- 
uals with short reaction times are more likely 
to have accidents. However, the only other 
statistically significant coefficient in Table 4 is 
the one between mean reaction time and age 
(r = .44). It follows that the apparent mean 
reaction time versus accident experience cor- 
relation actually could be partly due to an age 
effect and not entirely to a reaction-time con- 
sideration. The correlation of —.18 + .16 be- 
tween age and the particular accident split, 
admittedly not significant statistically, is nev- 
ertheless indicative of such a possibility. The 
computed partial correlation of mean reaction 
time with accidents is reduced to — .25 with the 
age factor removed. This —.25 correlation is 


Table 4 


Tetrachoric r’s and SE’s : 








Variable 


, Accidents 





“Cc” or “No” 
vs. “Aa” or “BRB” 


“No” vs “A ” 
“” or “C” 


“ ” 
B,” “C” or 
“No” vs sa” 





SD of Reaction Time* 


Mean Reaction Time* 


Age 


—.07 + .16 
N= 87 


.08 + .16 
N= 87 


— .04 + .16 
N= 87 


09 + .16 
N= 87 


— 30+ .15 
N= 87 


— .04 + .16 
N= 87 


—.10 + .16 
N= 87 


—.18 + .16 
N= 87 


—.06+. 
N= 





* Based on 10 reaction-time measurements. 


Results 


The basic test data are shown in Table 3. 
The tetrachoric 7’s and the SE of each r are 
shown in Table 4.2 


*The SE’s shown in Table 4 were estimated by 
applying the formula: 
1.5(1 — r?) 


SEn Wo 


According to Garrett (2), “‘an approximation to the SE 
of a tetrachoric r may be found in the following way: 


not significant at the 5 per cent level of con- 
fidence. This establishes the need for careful 
consideration of the results before generalizing 
as to a mean reaction time-accident experi- 
ence relationship. 

It is also interesting to note the lack of statis- 
tically significant correlation between mean 
reaction time and standard deviation of reac- 


the a: is about 50% higher than the SE of an equiva- 
lent product-moment 7. . .” 
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Table 5 


Analysis of Variance in Mean Reaction Times of 
Accident Groups “B,” “C” and “No” 








Source of 
Variation 


Sum of 
Squares 


Degreesof Mean 
Freedom Square 





Between Groups 58.672 2 


= 233 “No” 


29.336 


Within Groups 
“3” os, gd “No” 


1501.258 72 20.851 





Total 1559.930 74 





29.336 
Feritica 1% = 4.92 


5% = 3.13 





tion time. This indicates that these two meas- 
ures are independent variables. This lack of 
correlation between mean reaction time and 
standard deviation of reaction time should be 
taken into consideration in comparing the 
validity coefficients obtained for mean reaction 
time with those obtained for standard devia- 
tion of reaction time. Apparently, the general 
similarity between these two sets of validity 
coefficients is not due to a high correlation be- 
tween mean reaction time and standard devia- 
tion of reaction time. Although mean reaction 
time apparently increases with age, as indicated 
by the correlation of .44 between these two 
variables, variability of reaction time does not, 
as indicated by the lack of statistically signifi- 
cant correlation between age and standard devi- 
ation of reaction time. 
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Inasmuch as a statistically significant tetra- 
choric correlation (5 per cent level of confi- 
dence) was found between mean reaction time 
and accidents “split” on the basis “B,” “C” or 
“No” accidents vs. “A” accidents, the accident 
groups “B,” “C” and “No” were subjected to a 
further analysis by applying an analysis of vari- 
ance technique. Results of this analysis are 
shown in Table 5. 

From Table 5 it can be seen that there is no 
statistically significant difference between the 
mean reaction times of accident groups “B,” 
“C” and “No.” This established the homo- 
geneity of these three accident groups with 
respect to mean reaction time and permitted 
treating them as one group in subsequent 
analyses. 

It was next desired to test the hypothesis 
that no difference existed between the mean 
reaction time of accident group “A” and the 
mean reaction time of accident groups “B,”’ 
“C” and “No” treated as a single group, with 
no hypothesis assumed as to population vari- 
ances. A modified Student’s /-test, described 
by Snedecor (5, p. 83), was used in which the 


t'is weighted according to the separately com- — ; 


puted variance of each of the two samples. 
Results of this analysis are shown in Table 6. 

From Table 6 it can be seen that there is a 
statistically significant difference (5 per cent 
level of confidence) between the mean reaction 
time of accident group “A” and that of accident 
groups “B,” “C” and “No,” when the latter 
are treated as a single group, with no hypo- 
thesis assumed as to population variances. 
This result is in agreement with the also statis- 


Table 6 


Significance of Difference between Mean Reaction Time of Accident Group “A” and Mean Reaction Time of 
Accident Groups “B,” “C” and “No” Treated as a Single Group 
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of 
Freedom 
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Mean 
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SD = ¥1.34857 = 1.161 
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140.88 
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1.34857 
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tically significant tetrachoric correlation of 
— .30 obtained between mean reaction time and 
the same accident groupings. 

Although the tetrachoric technique appears 
to be useful as a rough, preliminary method for 
looking at a given body of data from a number 
of different aspects, two additional points must 
be recognized before conclusions are drawn. 
First, this particular tetrachoric correlation 
was the only significant one of the nine which 
were calculated, and could have occurred by 
chance much as the so-called “nonsense corre- 
lation” (which is obtained when a body of 
data is examined from enough different aspects 
until a significant relationship is found) can 
exist due to chance. Second, the usual proce- 
dures for the evaluation of the statistical signi- 
ficance of a correlation coefficient are not 
applicable when the correlated data are not 
normally distributed; this is usually the case 
with accident records. The commonly used 
standard errors are then hard to interpret. On 
the other hand, the reaction times of policemen 
are probably approximately normally distrib- 
uted, and thus the /-test seems to be entirely 
appropriate. In this particular case, the ob- 
tained agreement between the tetrachoric and 
the /-test simply corroborates the significant 
relationship indicated by the tetrachoric. While 
the result can in no way be construed as being 
conclusive evidence for the existence of this 
relationship, it is suggestive of such a possibil- 
ity, and it would be of interest to test a new 
body of similar data for this relationship. 


Conclusions 


It can be concluded from the foregoing results 
that the stated hypothesis has not been estab- 
lished under the conditions of measurement. 
However, the fvllowing lines of investigation 
should be pursued further before the hypothesis 
_ is conclusively rejected: 

1. The single kind of measure of reaction 
time used in this study may not yield variabil- 
ity measurements which are representative of 
reaction-time situations in general. Basic re- 
search is needed to determine the number of 
different situations for which reaction-time 
variability must be measured in order to ade- 
quately represent general reaction-time vari- 
ability. ; 


W. L. Cation, G. E. Mount and R. Brenner 


2. It is possible that the subjects used are 
not representative of other groups of drivers 
for determining the relationship between reac- 
tion-time variability and accidents. Basic 
research is needed to establish representative 
classes of drivers within the total population of 
drivers. ts 

3. The analysis of variance establishes an 
apparent homogeneity of reaction times of a 
population made up of the “B,” “C” and “No” 
accident groups, and suggests the possibility of 
a difference between the mean reaction time of 
this population and that of the “A” accident 
category. However, the possibility that such 
a relationship is in reality a function of age, to 
which reaction time is apparently related, pre- 
cludes any generalization of this result. It 
also remains possible that the classification of 
accidents used is not correct for determining a 
relationship between reaction-time variability 
and accidents. Reaction-time variability may 
be related to a class of accidents not used in 
this study. Basic research is needed to estab- 
lish representative classes of accidents. 


Summary 


The purpose of this study was to test the 
hypothesis that an individual whose reaction 
times are relatively consistent throughout ~a 
series of measurements, irrespective of the mag- 
nitude of his mean reaction time, will be found 
to be less susceptible to automobile accidents 
than is the individual whose reaction times are 
more variable. 

A reactometer was employed in making sim- 
ple reaction-time measurements on a sample of 
104 Radio Patrol Officers. The reactometer 
measures the time required for the subject to 
move his foot from an “accelerator” pedal to a 
“brake” pedal in response to a small, red stim- 
ulus light. Ten successive measurements were 
recorded for each subject. 

A mean and a standard deviation were com- 
puted for each subject’s distribution of 10 reac- 
tion times. The mean reaction times ranged 
from .348 to .563 seconds with a mean of .433 
seconds. The standard deviations ranged from 
.0156 to .0983 seconds with a mean of .0393 
seconds. The statistical reliability of the 
means calculated by the split half technique 
was .87. Correction for the number of meas- 
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ures used yielded a reliability coefficient of .93. 
The uncorrected reliability of the standard 
deviations was .27 which similarly corrected 
yielded a reliability coefficient of .41. 

The basic criterion data consisted of the 
official reports of accidents sustained by the 
subjects while on duty as drivers in police 
patrol cars. These accidents were subjectively 
classified into three different categories as 
follows: 

“A” accident—an accident in which the offi- 
cer was apparently at fault; an “avoidable” 
accident. 

“B” accident—an accident in which respon- 
sibility was doubtful or indeterminable. 

“C” accident—an accident in which the offi- 
cer was apparently not at fault; an “unavoid- 
able” accident. 

An initial analysis of the data was made by 
means of a tetrachoric correlation technique. 

A statistically significant correlation (1 per 
cent level of confidence) was found between 
mean reaction time and age. 

A statistically significant negative correla- 
tion (5 per cent level of confidence) was found 
between mean reaction time and accidents 


when accidents were “split” on the basis “B,” 
“C” and “No” accidents vs. “A” accidents. 
The correlation, however, was nonsignificant 
at the 5 per cent level when age was partialed 
out. 


No statistically significant correlations were 
found between standard deviations of reaction 
time and (1) age, or (2) mean reaction time, or 
(3) accidents. 


107 


Using an analysis of variance technique, it 
was found that there was no statistically signi- 
ficant difference between the mean reaction 
time of accident groups “B,”’ “C” and “No.” 

Using a modified /-test, it was found that 
there was a statistically significant difference (5 
per cent level of confidence) between the mean 
reaction time of accident group “A” and the 
mean reaction time of accident groups “B,” 
“C” or “No” when the latter were treated as a 
single group. However, here the age effect 
was not taken into account. 

The results of this study do not substantiate 
the hypothesis that variability of reaction time 
is positively correlated with susceptibility to 
automobile accidents under the conditions of 
the present study. Certain basic research is 
needed to establish the generality of these 
results. 


Received May 19, 1950. 
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Psychological Climate and Accidents in an Automotive Plant * 


Vernon Keenan and Willard Kerr 
Illinois Institute of Technology 


and 


William Sherman 
International Harvester Company—Tractor Works 


In the belief that certain factors of psycho- 
logical climate and physical environment may 
be of great importance in the causation of acci- 
dents in heavy industry, the authors formulated 
a series of hypotheses concerning correlates of 
accidents and subjected each to an experimen- 
tal design for testing. The hypotheses were: 

1. Excessive variation in pressure to “get 
out production” induces stress and tension con- 
tributing to accidents. 

2. Intensity of normal production schedules 
contribute to accidents. 

3. High promotion probability encourages 
alertness which reduces accidents. 

4. A comfortable shop environment reduces 
accidents by minimizing physical annoyances 
and distractions. 

5. Emphasis on individual responsibility 
(rather than upon crew work) acts to reduce 
accidents. 

6. A reasonable level of job prestige is re- 
quired for safe behavior... ir 

7. Heavy manual work induces stress, fa- 
tigue, and weariness, which increase probabil- 
ity of accident. 

8. Incentive pay systems encourage alert- 
ness which in turn promotes safe behavior. 

9. Relative adequacy-predictability of work 
space is essential for avoidance of distraction- 
worry and safety. 

10. Presence of an obvious and threatening 
danger has a semi-hypnotic effect upon the 
worker and increases accidents which are unre- 
lated to the obvious danger. 


Subjects 


Lost-time accident records of 7,103 (average) 
personnel of a tractor factory for the 1944-48 


* The authors gratefully acknowledge the indispen- 
sable aid and original suggestions of Mr. Howard M. 
Huntington, Safety Supervisor, International Harvester 
Tractor Works. 


five-year period were tabulated for each of the 
44 shop departments of the plant, and accident 
rate per 100 personnel was computed for each 
department. The enterprise experienced 1,943 
lost-time accidents during this period, and 
departmental rates per hundred personnel per 
year ranged from 1.5 to 13.2. 


Procedure 


Five-point continuum ratings on each of the 
variables shown in Table 1 were collected inde- 
pendently from each member of a “best-in- 
formed” panel of judges. This panel of eight 
judges, all individuals with at least ten years 
of diversified experience in the plant, includes 
the staff assistant to the works manager, the 
assistant works manager, the general superin- 
tendent, the plant engineer, the chief of methods 
and rates, the planning engineer, the safety 
supervisor, and the industrial relations man- 
ager. Split-half (odd-versus-even judges) reli- 
ability coefficients corrected via the Spearman- 
Brown prophecy formula for these ratings are 
indicated in Table 1. 

Mean judge ratings on each department, on 
each of the ten variables, then were computed 
and the departmental means on each variable 
were correlated with departmental accident 
rates. Intercorrelations among all variables 
also are shown in Table 1. 


Results 


Results of these correlational studies between 
accident rates and certain psychological char- 
acteristics of organizational units are displayed 
in Table i. Asummary inspection of the table 
suggests that several of the ten hypotheses 
stated receive enough confirmation to continue 
them as tenable into more rigorous future re- 
searches. 
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Variation and Volume of Production Pressure. 
Neither of these variables seem to be signifi- 
cantly related to accident rate. The extra- 
ordinary fact of a negative (—.42) reliability 
cofficient suggests the chance identification of 
two “schools of thought” as to what consti- 
tutes “pressure to get things done.” Since 
each of these variables correlate substantially 
with at least one other variable in Table 1, it is 
obvious that the split-half reliability coefficients 
in these two instances are gross underestimates. 

Partial correlations of r,u.9 = —.18 and 
ri,1.4 = —.55 help estimate possible effects of 
incentive work and comfort of shop environ- 
ment on the relationship between accidents and 
rated variation in production pressure. When 
comfort of shop environment is held constant 
by partial correlation, the relationship in- 
creases to statistical significance, suggesting 
fewer accidents in departments with above aver- 
age variation in production pressure. While 
all statistics in this study which pertain to pro- 
duction pressure are somewhat questionable in 
interpretation because of the apparently di- 
vided opinions of raters, it is possible that vari- 
ation in production pressure tends to reduce 
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accidents by reducing monotony and encour- 
aging alertness. 

Similarly, when comfort of shop environ- 
ment is held constant, the partial correlation 
between constant production pressure and de- 
partmental accident rate attains (inverse) a 
value of unity. If this result is not artifactual, 
it may be that production pressure even when 
constant encourages alertness which in turn 
makes for safe behavior. 

Promotion Probability. Although the zero 
order correlation between departmental pro- 
motion probability and accident rate is insig- 
nificant, the relationship becomes significant 
(—.90) when comfort of shop environment is 
held constant by partial correlation. This 
result is consistent with the finding of a signi- 
ficant correlation of —.40 between these same 
two variables in a New Jersey electronics 
plant.! 

Comfortable Shop Environment. Most signi- 
ficant of the findings in this research is the dis- 
covery of a correlation of —.70 between accident 


‘Kerr, W. A. Accident proneness of factory depart- 
ments. J. appl. Psychol., 1950, 34, 167-170. 


Table 1 


Pearsonian Intercorrelations among Accident Rate and Each of Ten Other Variables in 
44 Departments of a Tractor Factory (7,103 Personnel) * 








Variable 1 2 3 


4 5 6 7 





. Variation in Pressure to 

Get Things Done . ‘ .00 
. Constant Pressure to Get 

Things Done P ; .23 

3. Promotion Probability 4 ‘ 92 

. Comfortable Shop Envi- 

ronment | 
. Degree of Crew Work ‘ 
. Job Prestige : j 94 
. Manual Effort Involved 
. Degree of Operational 

Congestion . . .00 
. Degree of Incentive Work ‘ 50 56 
. Degree of Obvious 

Danger Factor F 65 43 
. Accident Rate 11 —06 —.01 


—.67 


56 —30  .24 55 


A2 —.08 84 67 
— .28 94 .00 


44 —.59 
.00 58 46 
99 —30 —.34 
—.30 83 .68 


46 —.34 68 70 
— il 13 .05 36 49 


— 53 37 18 64 AT 43 
—.70 33 —.23 AT AS .28 





* Coefficients in boldface type are statistically significant at the five per cent level of confidence or better. 


In determining the statistical significance, the formula o, = 


i-r 
VN -1 


was used. Coefficients in italics represent 


odd-even-corrected reliability coefficients, except for accident rate which is not corrected. 
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rate and mean-rated comfort of the shop envir- 
onment. Partial correlations holding constant 
degree of operational congestion (— .60), man- 
ual effort involved (—.55), constant pressure 
(—.96), and degree of crew work (—.66) sustain 
the original correlation. The psychological 
implication of this result is that approximately 
half of the variance in departmental accident 
rates is accounted for by variance in the com- 
fort factor. Heavy industries in which forges 
and foundries are essential operations inevit- 
ably experience great difficulty in making some 
job environment physically comfortable. Heat 
radiating from molten metals, violent sound vi- 
brating from drop forges, and gritty earth par- 
ticles from foundry “molds” are vivid exam- 
ples of the practical obstacles to comfortable 
adjustment of the worker to the job. 

Workers with low “physical frustration tol- 
erance” under conditions of heat, noise, etc., 
probably have somewhat higher accident fre- 
quency than their co-workers. An accumula- 
tion of physical frustration tensions leads easily 
to “distractive’ behavior which often results 
simultaneously in reduced quality of work and 
accident. It should be emphasized, however— 


and this qualification is supported by the evi- 


dence of Mintz and Blum*—that variance in 
such accident proneness of indivudals probably 
accounts for a minority of the variance in acci- 
dent rates. In other words, the “normal” 
worker as well as the “prone” worker is prone 
to these same unsafe behaviors under identical 
frustrating conditions, but at a slightly better 
threshold of tolerance. A reasonable conclu- 
sion under such evidence appears to be that 
major preventative effort can most profitably be 
expended in working to alleviate as much as pos- 
sible the overall physical frustrations which im- 
pinge on all personnel. Such effort can reason- 
ably be expected to reduce the accident rates of 
both the more-prone and less-prone personnel. 

Degree of Crew Work. Departments in which 
the typical job performance is in a “crew” often 
have above-average accident rates. Crew work 
in difficult operations (e.g., moving a hot ingot 
to position in a drop forge and later removal of 
the shaped part calls not only for reasonably 
synchronous action and cooperation of the crew 

? Mintz, A., and Blum, M. L. A re-examination of 


the accident-proneness concept. J. appl. Psychol., 
1949, 33, 195-211. 


V. Keenan, W. Kerr, and W. Sherman 


members, but constant vigilance for the con- 
tinous accepting and relinquishing of respon- 
sibility as well as the development of any unus- 
ual circumstances. Less stringent cooperative 
work sometimes has greater accident probabil- 
ity also in large part perhaps because of the 
occasional transition moments in which respon- 
sibility for consequences is vague or suspended 
but operations nevertheless continue, from 
momentum if nothing else. In those brief 
seconds, accidents are made. In individual- 
type work the individual rarely is uncertain 
about responsibility for consequences. 

Job Prestige. It is interesting to note that 
while this correlation of —.23 with accidents is 
not statistically significant, it\is highly con- 
sistent with the value of —.30 found in the 53 
departments of the Camden Works of RCA’ 
Combining the two plants makes a total of 97 
departments with a total of 18,000 personnel, 
and the relationship between job prestige and 
accident rate is established as highly significant. 
Partial correlation evidence, however, suggests 
that the relationship is at least in part arti- 
factual. When comfort of shop environment 
is held constant, the coefficient declines to .12, 
and when manual effort is held constant it 
declines to —.16. High accident departments 
are characterized by less physical comfort and 
a demand for greater manual effort; inciden- 
tally, they are of lower average prestige value, 
but this fact seems to have little causal bearing 
upon accident rate. 

Manual Effort. The greater the degree of 
manual effort involved in successful perfor- 
mance of the typical job, the greater the ten- 
dency toward accidents (r = .47). This rela- 
tionship persists (r7,10.2 = .96) when “pressure 
to get things done” is held constant by partial 
correlation. As in the demand for constant 
vigilance in crew work, the demand for physical 
energy sometimes exceeds efficiently directed 
supply. Workers trained in economical expen- 
diture of effort as well as in control of physical 
forces may reduce the frequency of such acci- | 
dents. 

Operational Congestion. “Degree of opera- 
tional congestion” implies frequency-proximity 
of moving objects and persons. Such activities 
in the psychological periphery of the immediate 


* Kerr, W. A. Op. cit. 
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task may connote greater annoyance-distrac- 
tion value than actual physical threat to the 
worker, but may contribute (r = .45) to unsafe 
behavior of personnel. A limitation to this 
interpretation is the fact that the correlation 
declines to .06 when comfort of shop environ- 
ment is held constant by partial correlation. 
In view of this limitation, a causal interpre- 
tation of the zero order r is not warranted. 

Incentive Work. Departments with incen- 
tive work may on the average incur a few more 
accidents than do departments on the “straight- 
time” payment system. - This is suggested by 
the original correlation of .28 (not significant) 
which is increased to .35 when promotion prob- 
ability is held constant. While evidence is not 
available to test the point, it is possible that 
when more work is being performed (as it 
usually is under an incentive system) more acci- 
dents will occur. Thus, an increase in acci- 
dents under incentive conditions might be a 
function of increased productivity or productive 
activity (“motionless bodies” rarely have acci- 
dents). On the other hand, this finding barely 
borders on statistical significance and similar 
study of another factory yielded a correlation 
of .00 between incentive system and depart- 
mental accident rate.‘ 

Obvious Danger Factor. Departments with a 
constant obvious danger present such as glow- 
ing molten metal have significantly higher acci- 
dent rates (ry = .35) than do other departments. 
An arresting finding by Mr. Huntington, the 
Safety Supervisor, on the nature of the acci- 
dents in these departments is that the accidents 
usually are not physically involved with the 
obvious hazard factor. For example, in molten 
metal operation departments, accident rates 
are above average but they rarely include 
burns. The obvious hazard seems to exercise 
an almost hypnotic effect upon personnel, de- 
limiting their attention too much and facili- 
tating their involvement in other hazards. 
When comfort of shop environment is held 
constant (this is a questionable procedure in 
this instance because “comfort” and “danger 
factor” may be regarded as almost synonomous 
in certain key departments), the zero order r 


“Kerr, W. A. Op. cit. 
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declines to a partial r of —.04 between obvious 
danger factor and departmental accident rate. 


Summary 


Against a criterion of five years of lost-time 
accident rates in the 44 shop departments of a 
tractor factory, ten possible explanatory hy- 
potheses were evaluated by means of data from 
a special rating panel. Until more exacting 
and precise data are obtained, all conclusions 
of this research must be considered tentative; 
however, within these limitations, certain trends 
appear. 

1. Results obtained on variation in produc- 
tion pressure and constant production pressure 
are inconclusive. 

2. The hypothesis that promotion probabil- 
ity encourages safe behavior appears to be sup- 
ported by this study. 

3. Comfortable shop environment appears 
from this research to be a major determinant of 
safe behavior. 

4. Degree of crew work is positively corre- 
lated with accident rates of the factory depart- 
ments in this study. This is believed to be the 
result of more complete and continuous accep- 
tance of individual responsibility in the non- 
cooperative work situation. 

5. Job prestige of typical job fails to corre- 
late significantly with departmental accidents 
when certain other variables are held constant. 

6. The greater the degree of manual effort 
involved, the higher, on the average, is the 
departmental accident rate. 

7. Both degree of operational congestion and 
degree of incentive work fail to predict depart- 
mental accident rates when certain other vari- 
ables are held constant. 

8. The effects of an “obvious danger factor” 
on departmental accident rates are somewhat 
uncertain, although observational evidence in- 
dicates that where an obvious danger factor 
exists, the accidents which occur tend not to be 
identified with the obvious danger. Existence 
of an impressive obvious hazard seems to con- 
tribute to “unrelated” accidents by delimiting 
attention and encouraging proneness to involve- 
ment in the non-obvious hazards. 


Received June 5, 1950. 








Industrial Use of the Partington Pathways Test 


Lee W. Cozan 
Hechinger Co., Washington, D. C. 


No general claims of superiority regarding 
reliability and validity may be made in behalf 
of the short mental ability test in comparison 
with the longer test. However, Otis and Ches- 
ler (6) have pointed out several advantages in 
the utilization of the short mental ability test 
in the industrial situation. In the case where 
time is at a premium, the short test of mental 
ability may be used with more than acceptable 
validity. Where the selection standards are 
more exacting, the short test may be used as a 
screen to indicate those applicants who are 
definitely below or above the arbitrarily set 
mental level. This is a time-saving and eco- 


nomical procedure inasmuch as these applicants 
are not administered the entire battery of tests. 

A survey of eight paper-and-pencil tests of 
mental ability (1, 2, 4, 5, 9, 10, 11, and 12) 
suitable for employment purposes by business 
and industry, most of which are listed in the 
Third Mental Measurements Yearbook (3), 


revealed the “working time’’ ranged from 10 to 
50 minutes with the total administration time 
varying from 12 to 60 minutes. The means for 
“working time” and administration were 24 
and 30 minutes, respectively. See Table 1. 
To all appearances, there are a few short tests 
of mental ability suitable for use in the indus- 


Table 1 


Distribution of Time for Eight Mental Ability Tests 








Adminis- 
tration 


Working 
i Time 


Test Time 





Adaptability : 20 
Classification Test for Industrial 

and Office Personnel 12 
Benge Employment 30 
Bureau Test VI—Mental Alertness* 5 25 
Otis Self-Administering 35 
Scovill Classification 60 
Wesman Personnel Classification 35 
Wonderlic Personnel 20 





* This test is no longer published. 


trial situation. A short rest is arbitrarily de- 
fined as 15 minutes or less. Only three of the 
eight tests required 16 minutes or less. 

Prompted by this information, a research 
project was undertaken to determine the suit- 
ability of the Partington Pathways Test (7) as 
a short test of mental ability in the industrial 
situation. 


Description of the Test 


The Partington Pathways Test, initially 
called the Distributed Attention Test, was orig- 
inally devised in 1938 as a test of speed of 
performance. However, it appeared that suc- 
cessful performance on the test demanded the 
integration of previously learned material into 
a complex pattern of response requiring shifts 
and fluctuations in attention, accuracy and 
speed, organization, retention, recall, and syn- 
thesis. 

When the test was administered in 1942 to an 
unselected adult male population in order to 
develop a standardized procedure for adminis- 
tering the test and a method of scoring, the 
results led the authors to draw the conclusion 
that the test was a measurement of general 
mental ability because the scores on the test 
were in general agreement with the estimated 
intelligence of the individuals to whom the test 
was administered. The conclusion was sub- 
stantiated when the test was administered to a 
random sampling of a group of 256 male World 
War II veterans, ranging in age from 19 to 36, 
who had been given the 1937 Revision of the 
Stanford-Binet Scale of Intelligence. The co- 
efficient of correlation between the scores on 
the Pathways Test and the Revised Stanford- 
Binet Scale was found to be 0.68 (8). 

The Pathways Test has two forms. In Form 
I, the task requires the subject to draw a line 
connecting numbers from 1 to 25 in order 
beginning with number 1. The numbers are 
arranged in random order. Form II uses both 
numbers and letters and requires a line be 
drawn commencing with number 1 to letter A, 
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Table 2 


Average and Variability of Scores on the Otis 
S-A Test and Pathways Test 








Otis S-A Pathways 





Range 
Mean 
Standard Deviation 


20-66 
41.5 
11.7 


6-40 
21.6 
8.2 





from letter A to number 2, then from number 2 
to letter B, and so on. The time in seconds 
required to finish each pathway is the score. 
The time required is converted to a raw score 
by means of two tables (8). 


Results 


For the purpose of experimentally applying 
the Partington Pathways Tests in an industrial 
situation, a group of 80 clerical and skilled 
employees selected at random were adminis- 
tered the test. The group also had been given 
given the Otis Self-Administering Test of Men- 
tal Ability.' The correlation coefficient be- 
tween the scores made on the two tests was 
found to be 0.74. Table 2 summarizes the 
information about the range, mean, and 
standard deviation of the subjects on the Path- 
ways Test and Otis Test of Mental Ability. 

The average “working time” on the Path- 
ways Test was 135 seconds, ranging from 78 to 
255 seconds. Administration time ranged 
from 5 to 9 minutes, averaging approximately 
7 minutes. 

Summary 


A short test of mental ability has been pre- 
sented which appears very practical for use in 


1A ial printing of the test without the words 
‘‘mental ability” was used. 
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the industrial situation. Its chief advantages 
are: (1) a short “working time”; and (2) ease 
of administration and scoring. Its one draw- 
back is that being a speed test it is disadvan- 
tageous to use with older applicants since it has 
been found that older adults do not perform so 
well on tests depending chiefly upon speed of 
response. At the present writing, the test has 
been validated with another test, the Otis Self- 
Administering Test, with a resulting correla- 
tion coefficient of 0.74. 


Received June 29, 1950. 
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Mirror-Tracing Practice for 


a petit 


Medical and Dental Students 


Norma V. Scheidemann 
Los Angeles, California 


Dentists and nose-throat specialists conduct 
a considerable proportion of their professional 
practice in mirror-vision, with the mirror placed 
at every possible angle in relation to the arena 
of operation. General medical practitioners 
have a limited use for mirror-performance since 
they employ it with the indirect method for 
removing objects lodged in a mouth passage. 
Students of medicine and dentistry routinely 
develop mirror-performance skill incidentally, 
during laboratory and clinical practice, by more 
or less haphazard methods with attention fo- 
cused upon professional problems. 
It is suggested that these students develop 
mirror-performance skill through isolated sys- 
_ tematic mirror-tracing practice with attention 
focused upon orientation in mirror-spaces pre- 
sented in a mirror placed in a wide variety of 
positions, before clinical practice involving 
mirror-vision is undertaken. 
Several items are available’ for conducting 
such practice expeditiously, namely: 





F 

Fic. 1. A mirror-scope. A, Mirror-screen with 
pports; B, metal square with conduits; C, metal 
Square with supports; D, performance planes (supports 
of C inserted into conduits of B); Z, assembled mirror- 
scope (supports of A and C inserted into conduits 
of B); F, collapsed mirror-scope. B and C have com- 
pass markings for gauging direction of tracing on 
performance planes. 





1 Available from the C. H. Stoelting Company, 
Chicago, Illinois. 


A Mirror-Scope (Fig. 1). This device pro- 
vides for: (1) the reference mirror-spaces—for- 
ward, backward, leftward, rightward, upward, 
and downward—and all mirror-spaces at any 
angle to any of them; (2) mirror-vision in any 
mirror-space; (3) three-dimensional perform- 
ance planes for any mirror-space; and (4) com- 
pass markings on performance planes for gaug- 
ing direction of tracing at any angle, on any of 
the three-dimensional planes, in any mirror- 
space. 

The Compass Diagnostic Test of Directional 
Mirror-Tracing Skill (Fig. 2). This test pro- 
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Fic. 2. Compass test-pattern. 


vides, for diagnostic purposes, objective records 
of mirror-tracing skill in 24 radial directions 
on each of the three-dimensional planes, in 
each of the six axial mirror-spaces. 
Trace-and-Score Mirror-Tracing Practice-Pat- 
terns. These are three patterns of graded diffi- 
culty printed on a scoring scale. A Linear 
Practice-Patiern (Fig. 3) consists of 10 parallel 
lines that can be gauged to direct tracing at any 
angle on the three-dimensional planes, in any 
mirror-space. A 100 Angles Practice-Patiern 
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Mirror-Tracing Practice for Medical and Dental Students 


Fic. 3. Linear practice-pattern. 


(Fig. 4) consists of a continuous line featuring 
100 right angles. It is printed in two sizes. 
The large size directs practice in directional 
change in tracing axial directions in any mirror- 
space. The small size has shorter lines, which 
offer less opportunity for directional orienta- 
tion between directional changes. This size 
can be gauged for practice in directional change 
in tracing axial lines and also lines at any 


Fic. 4. 100 angles practice-pattern. 


specific angle on any of the three-dimensional 
planes in any mirror-space. 


Received January 5, 1951. 
Early publication. 
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A Study of Interview Refusals 


Sherwood Benson, Wesley P. Booman, and Kenneth E. Clark * 
University of Minnesota 


In the usual public opinion survey from 10 
to 20 per cent of the respondents refuse to be 
interviewed. Very little is known about these 
“refusals.” Do they have markedly different 
attitudes from those who cooperate? Or is it 
safe to assume that their opinions are basically 
similar to the opinions held by the more cooper- 
ative respondents who make up 80 to 90 per 
cent of the universe? 

Refusal rates have seldom been reported in 
the polling literature. Some researchers have 
regarded the reported refusal rate of 10 to 20 
per cent as alarmingly high (3). Refusal rates 
may be even higher—up to 30 per cent (5). 
Others express little concern (1, 6). They as- 
sume that refusals are not likely to differ in 
important respects from those who respond. 
However, neither group has published evidence 
supporting their conclusions. Certainly such 
evidence is essential before we can conclude 
that failure to complete refused interviews is 
causing a biased result. 

In this study we were concerned with two 
problems: (1) How can the refusal rate be cut 
down to insignificance? and (2) how do the 
refusals, if successfully completed, affect the 
survey results? Heneman and Paterson (3, 
p. 393) report that for many characteristics, 
persons who refuse to cooperate have decidedly 
different characteristics from those who do co- 
operate. However, the study which they cite 
(4) deals not with refusals but with respondents 
who were initially not at home. Furthermore 
it is not enough to know that those who refuse, 
or are not at home, have decidedly different 
characteristics from those who can be contact- 
ed. Unless such characteristics are related to 
the survey results, different characteristics per 
se, do not assure the pollster that the exclu- 
sion of the “refusal” or “not at home” is intro- 
ducing any bias. 


* The planning of the interviews of refusals, and the 
actual interviewing, was done by the first two of the 
three writers as part of their work towards the Master 
of Arts degree at the University of Minnesota. 


One of the few studies concerned with refusals 
has been made by Gaudet and Wilson (2). 
Their study is based on attempts to re-inter- 
view 1,800 panel members selected as represen- 
tative of Erie County. They found that refus- 
als tended to come most frequently from house- 
wives and people of low educational status. It 
should be noted, however, that their study dealt 
not with original refusals but with people who 
refused to be re-interviewed. 


Procedure 


The questionnaire used in this study was 
constructed by 60 members of a public opinion 
analysis class at the University of Minnesota 
and dealt with attitudes toward Negroes. It 
included twelve questions of the sort, “If you 
were in a hospital, would you mind having a 
Negro nurse?”’ A fixed address sample was 
selected by the instructor by listing every sep- 
arate dwelling unit in a class C socio-economic 
residential area in South East Minneapolis. 

Students enrolled in the public opinion class 
conducted the initial interviews. This group, 
which might be expected to have a better than 
average grasp of the problems of the public 
opinion survey, though they were somewhat 
lacking in experience, probably compares satis- 
factorily with the usual survey interviewers. 
With instructions to complete an interview 
with a ‘responsible adult” living in the unit, 
and to make up to three calls at a fixed address 
in the event the respondent was not at home, 
this group completed 173, or 75.2 per cent, of 
230 assigned interviews. Twenty-four, or 10.4 
per cent, were still not at home after three calls, 
and 33, or 14.4 per cent, refused to answer the 
questionnaire. 

The reasons for refusals as reported by the 
original interviewers were: Very busy, 11 or 33 
per cent; Sickness, 7 or 21 per cent; Not inter- 
ested, 2 or 6 per cent; and No reason, 13 or 39 
per cenit. 

The problem facing the writers then was, 
first, to contact all respondents reported not at 
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home, and second, to attempt to complete all 
the interviews which were initially refused. 
After making as many as five additional inter- 
views the 24 not-at-home respondents were 
successfully interviewed. Most of the inter- 
viewees (both refusals and not-at-homes) were 
ultimately contacted between 6:30 and 7:30 
p.m. The calls required to complete these 24 
and the original 173 are shown in Table 1. 

After most of the not-at-home respondents 
were contacted a method for completing the 
refused interviews was developed. Purely 
through armchair stratagem it was decided that 
the best results in this particular study might 
be obtained by using a young and pretty female 
interviewer.' One thing was clearly evident 
from the record of the refusals: most people 
refused before they even knew what the ques- 
tions were about. In training the interviewer 
strong emphasis was placed upon her appearance 
and a friendly manner. 

Numerous test interviews were conducted 
with the interviewer, the writers playing the 
role of reluctant respondents. A variety of 
different answers, based upon the anticipated 
objections of respondents, were prepared and 
rehearsed. Since practically all of the refusals 
occurred before any survey questions were asked, 


1 Approximately three out of four of the original 
interviewers were men. 
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such things as a friendly smile and an easy and 
confident greeting were stressed. It was also 
suggested that the interviewer: (1) keep talking 
as long as possible even though it appears cer- 
tain that the respondent will refuse; and (2) 
not step back or retreat from the doorway until 
the respondent actually closes the door. By 
accompanying the interviewer and observing 
her methods from a car the writers were able to 
offer suggestions and criticism as the survey 
progressed. 

On her first call our female interviewer com- 
pleted 29 of 33 attempted interviews with re- 
spondents who had initially refused to cooper- 
ate. In three of the four remaining cases, the 
writers, believing that less resistance might be 
encountered by another interviewer, attempted 
to re-interview the respondents. Before any 
re-interviewing, however, follow-up letters 
describing the survey and asking for coopera- 
tion were sent to the four remaining refusals. 
After the letters were sent the female inter- 
viewer successfully completed one and the 
writers two out of three of the re-interviews. 
When the remaining respondent had refused to 
cooperate it was decided to send him a second 
letter which included a questionnaire and a 
stamped, self-addressed envelope. No reply 
was received. 


Table 1 


Summary of Completed, Not-Home, and Refused Interviews Arranged by Call and Showing Time 
Required to Complete Each Set of Interviews 








Cum. % Time 
to Complete* 


Completed 


Yet-to- 


Not-Home Complete 


Refused 





Ist Call 136 50 
2nd Call 2 66 
3rd Call 75 
4th Call 5 82 
5th Call 85 
6th Call 86 
7th Call 87 
8th Call 88 
Additional Calls on 33 Refusals: 
1st Call 21 95 
2nd Call 8 98 
3rd Call 2 99 
4th Call 1 100 


70 94 
40 65 
28 

42 





* Per cent of total interviewing time required to complete 229 of the 230 interviews, assuming an interviewer 
using an automobile, and living 5} minutes from the interviewing area. 
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S. Benson, W. P. Booman, and K. E. Clark 


Results 


By carefully training and selecting interview- 
ers it was possible to complete 99.6 per cent of 
the interviews in a fixed address sample. 
(N = 230.) Since the refusals occurred before 
any questions were asked, appearance and 
manner of the interviewer seemed to be a major 
factor in reducing refusals. 

In order to complete interviews originally 
refused, about 14 per cent additional time was 
required. By trying both to contact the not- 
at-home respondents after three calls and to 
complete the refusals, about 33 per cent more 
time was required. This does not include the 
additional time necessary to treat these re- 
spondents on a more individual basis, which 
consisted of planning interviews, training the 
interviewer, and writing letters. 


Table 2 


Comparison of Refusals and Non-Refusals 
on Four Variables 





Attitude Years 
toward Educa- 
Negroes _ tion 


% 


Union 
Age Members 





Means: 
Total Group 
(N = 229) 
Non-Refusals 
(N = 197) 
Refusals 
(N = 32) 
Mean Differences, 
Refusals—Non- 
Refusals 
SEaisr 
t 


15.83 11.21 43.4 46.7 


16.22 11.43 46.7 


9.88 


13.41 


46.9 


2.81 
0.959 
2.93* 


1.55 
0.595 
2.61* 





* Significant at the one per cent level. 
t Non-significant, not computed. 


A comparison of the original refusals and the 
non-refusals was made on four variables: mean 
total score on attitudes toward Negroes,” years 
of education, age and union membership. These 
comparisons are summarized in Table 2. 


? Questionnaires were scored by assigning 2 value of 2 
to each favorable response, a J to an equivocal response, 
and a 0 to an unfavorable response. For the twelve 
aut scores ranged from 24 (perfect favorable score) 
to 0. 


Both in attitudes towards Negroes, and in 
years of eduaction, those respondents who ini- 
tially refused to be interviewed differ signifi- 
cantly from those who did not, the refusals 
being less favorable towards Negroes, and hav- 
ing less education. No appreciable difference 
was found between the two groups in either age 
or union membership. 

Since the refusals constituted only 14.4 per 
cent of the sample, their responses, when com- 
bined with those of non-refusals, did not affect 
either the total score on attitudes toward Ne- 
groes or the mean education level to any appre- 
ciable extent. This was true also in responses 
to individual items of the questionnaire, in that 
no percentage response would have deviated 
more than 3.4 points had the refusals been 
omitted from the study. (Even so, these dif- 
ferences are larger than the differences which 
would have obtained if the not-at-homes after 
three calls were withdrawn from the sample.) 
Had our refusal group been larger, as is prob- 
ably the case for many surveys, the net effect 
of the sort of difference observed here would 
have been magnified. 

Discussion 

In this study some indication has been given 
of the amount of time required to include in 
one’s sample the refusals or those not-at-homes 
requiring more than three calls to contact. 
About 14 per cent more time was required for 
interviewing respondents in order to include 
responses from those who initially refused to co- 
operate. About 33 per cent more time was 
required to complete interviews both with this 
refusal group and with those not at home for 
the first three calls. 

It is apparent that the decision on how much 
effort should be expended to complete all inter- 
views is a function of the particular type of 
question asked and the accuracy desired. If it 
is suspected that significantly different answers 
would be obtained from refusals as compared 
with non-refusals (as was found in this study), 
it would be wise to make a concentrated effort 
to complete enough refusal interviews to estab- 
lish trends. 

On the basis of the experience gained in this 
study, the writers would recommend: 
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1. That any agency seriously concerned with 
survey methods irain its interviewers so as to 
reduce initial refusals, 

2. That interviewers record on the schedules 
for refused interviews a short description of the 
refusing respondent, for later identification to 
attempt to complete the interview, and some 
indication of the reason for refusal, and 

3. That refusals be re-interviewed in any 
survey in which the number of refusals exceeds 
some minimum figure (say, 5 or 10 per cent), 
to determine whether this non-contacted group 
may influence to any appreciable degree the 
results of the survey. 


Received June 9, 1950. 
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Social Characteristics of Respondents and Non-Respondents in a 
Questionnaire Study of Later Maturity 


Ju-Shu Pan 
Chicago, Illinois 


The problem of selective response in. ques- 
tionnaire studies of adjustment of persons 65 
and over is important because of the increased 
chance of bias in favor of the better adjusted 
group. Non-response bias was analyzed in a 
study of personal and social adjustment of per- 
sons institutionalized in homes for the aged in 
the United States. 

Two hundred and sixty-three residents of 26 
homes sponsored by religious organizations 
filled out the schedule “Your Activities and 
Attitudes.”” These homes had a total of 2,566 
residents. Nine of the homes were sponsored 
by the Methodist Church and 17 were con- 
trolled by other Protestant groups. 

Schedules were distributed through super- 
intendents of the homes. Some refused to dis- 
tribute schedules in their institutions, often 
stating the residents were “too old.” Others 


Table 1 


Proportions of Schedule and Check List Respondents 
by Selected Social Characteristics * 








Check 
List 
Respond- 
ents 


Schedule 
Respond- 
ents 


Social 
Characteristics 


Critical 
Ratio 





Native-born 
Education 
None 1.3 7.2 
Grade 19.3 
College 24.4 11.2 
Professional 20.1 7.6 
Former Occupation 
Minister 
Clerical 
Housewife 
Health Rating 
Poor 
Excellent 
Physical Problems 
Four or more 3.8 8.8 


89.4 82.3 


15.3 4.0 
26.8 15.0 
19.8 37.0 


19.0 
18.3 


29.0 
10.3 


selected certain residents as respondents and in 
some cases asked for volunteer respondents. 
Superintendents of the same 26 homes were 
asked to prepare one page confidential “check 
lists” on every tenth resident, ranked alpha- 
betically. Check lists provided data on sex, 
age, education, occupation, health, and number 
of physical problems. Check lists for 335 (13 
per cent) of the 2,566 residents were received. 
Data for the randomly selected check list 
respondents were compared with the biased 
data for the 263 schedule respondents. The 
schedule sample was expected to be younger, 
better educated, and in better health than the 
check list sample. Non-respondents were ex- 


pected to be those residents suffering the great- 
est impairment of physical and psychological 
function. 

Mean age of the 263 respondents was 76.4 


years with standard deviation of 9.3 years. 
The check list sample was slightly older, with 
mean age of 79.5 years and standard deviation 
of 11.4 years. Critical ratio is 3.3. Signifi- 


Table 2 


Proportions of Check List Respondents in Cooperating 
and Non-Cooperating Homes by Selected 
Adjustment Characteristics * 








Non- 
Cooperating Critical 
Homes Ratio 


Cooper- 
ating 
Homes 


Adjustment 
Characteristics 





Nervousness 
Extremely nervous 0.9 
Very nervous 5.3 
Somewhat nervous 51.3 
Not nervous 42.5 

Happiness 
Very happy 
Average 

Adjustment 
Very well adjusted 


10.7 
19.6 
41.9 
27.8 


20.5 
32.8 


34.8 
20.8 


41.3 31.3 





* Number of cases on which percentages were com- 
puted ranges from 157 to 335. 


* Number of cases on which percentages were com- 
puted ranges from 232 to 279. 
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Social Characteristics of Respondents and Non-Respondents 


cant differences by nativity, education, occu- 
pation, health, and number of physical prob- 
lems are presented in Table 1. 

Differences between residents of homes with 
schedule respondents and homes without 
schedule respondents, as measured by super- 
intendents’ check lists, are presented in Table 2. 

Respondents filling out schedules and re- 
spondents for whom check lists were returned 
by superintendents differ significantly in the 
following ways: schedule respondents are older 
and more are native-born; more of the respond- 
ents had college or professional training, and 
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higher proportions were former ministers or 
clerical workers; respondents rate themselves 
healthier than the total group rated by super- 
intendents; fewer of the respondents have as 
many as four physical problems. 

Check list residents of homes with schedule 
respondents were rated as significantly less 
“nervous” than were check list residents of 
non-cooperating homes. A higher proportion 
were “very well adjusted’’ but significantly 
fewer were rated as “very happy.” 


Received June 6, 1950. 








' MMPI and Student Teaching 


John U. Michaelis and Fred T. Tyler 
University of California 


The diagnostic and predictive value of per- 
sonality tests are of basic importance in plan- 
ning for the use of such measures in a teacher 
education program. The reliabilities of vari- 
ous sub-tests and the intercorrelations between 
sub-tests are required in determining diagnostic 
value. The relationship of various sub-tests 
to teaching success gives an indication of pre- 
dictive value. With diagnostic and predictive 
value in mind, the usefulness of a particular test 
can be given effective preliminary appraisal. 

The purpose of this article is to make such 
an evaluation of the Minnesota Multiphasic 
Personality Inventory (2). The following nine 
scales are utilized in this study: kypochondriasis, 
depression, hysteria, psychopathic deviate, mas- 
culinity-femininity, paranoia, psychasthenia, 
schizophrenia and mania. 

The data in this article are based upon 56 
women students enrolled in student teaching. 
The mean chronological age was 23.5 years, 
with a standard deviation of 1.57. The range 
in age was from 21 to 41, 51 of the students 
being between 21 and 27 years of age. The 
test was given immediately before entrance 
into student teaching. Only those cases with 
low L (lie) and F (validity) scores are included 
in this study. 


Diagnostic Value of Sub-Tests 


The diagnostic value of sub-tests has been 
determined by the use of a technique first pro- 
posed by Kelley (3) and more recently reported 
by Bennett and Doppelt (1). The technique 
involves the calculation of reliability coeffi- 
cients and intercorrelations of the various sub- 
tests. From these data the diagnostic value of 


sub-tests can be determined by means of a 
nomograph developed by Bennett and Doppelt 
(1) 


The reliabilities of the sub-tests were deter- 
mined by means of the split-half method and 
stepped up by the Spearman-Brown prophecy 
formula. Table 1 gives the reliabilities for a 
random sample of 100 women students tested 
in the elementary educational psychology 
classes at the University of California. 

Reliabilities are very low on only two scales, 
namely, paranoia and mania. With the excep- 
tion of masculinity-femininity (.61), all of the 
others are over .70. 

In order to determine the diagnostic value of 
the sub-tests the odd-even reliabilities have 
been used along with the intercorrelations of 
the sub-tests which are reported in Table 1. 

The intercorrelations vary from —.09 to .83, 


Table 1 
Corrected Split-Half Reliabilities of Sub-Tests and Intercorr:lations between Sub-Tests* 





‘ D 





Hypochondriasis d 50 
Depression : 76 
Hysteria ‘ 16 
Psychopathic deviate : 36 
Masculinity-femininity é .26 
Paranoia ‘ 04 
Psychasthenia ‘ 67 
Schizophrenia : 54 
Mania —.01 


37 
AT 


SEERRESERI 





* For d.f. = 100, 7 must be aon ae .256 to be si 
55, r must be .265 and 345 to 


ificant at the .05 and .01 levels, respectively (4, 
coefficients in boldface type are ee 100 cases; the other coefficients are b on 56 
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t at the .05 and .01 levels, respectively. For d.f. = 
<P 212). The reliability 
cases. 
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Table 2 
Diagnostic Values of Pairs of Sub-Tests 





Sub-Test D Hy 


Pd 


Zz 
» 





Hypochondriasis : 33 
Depression 37 
Hysteria 

Psychopathic deviate 

Masculinity-femininity 

Paranoia 

Psychasthenia 

Schizophrenia 

Mania 


.22 
.22 
30 


nN 


eds «# # © # @ & 


| 





* Mean reliability correlations below .70 are not shown in the nomograph for evaluating pairs of tests. 
Asterisks indicate that proportion of difference in excess of chance cannot be determined from the nomograph. 


16 being significant at the .05 level and 13 at 
the .01 level. In general, the intercorrelations 
are low enough to suggest that the various sub- 
tests are measuring different features of per- 
sonality. It is of interest to note the inter- 
correlations between the tests constituting the 
neurotic and psychotic triads. The neurotic 
triad includes hypochondriasis, depression and 
hysteria, the intercorrelations being .50, .32 
and .16, suggesting that if the different tests 
are measuring neuroticism they are measuring 
different aspects of it, at least in this group of 
college women. In the main the same applies 
to the sub-tests constituting the psychotic triad 
(paranoia, psychasthenia and schizophrenia), 
the intercorrelations being .12, .06 and .83; the 
psychasthenia scores correlate .83 with the 
schizophrenia scores, this being the highest-cor- 
relation obtained in this sample. 

The proportions of the differences between 
sub-tests in excess of chance have been deter- 
mined by means of the nomograph designed by 
Bennett and Doppelt (1). This procedure uti- 
lizes intercorrelations between sub-tests and 
their reliabilities to determine the differential 
power of the sub-tests. This technique “esti- 
mates the proportion of all differences for that 
pair of tests which will represent true differences 
in ability” (1). The proportions are shown in 
Table 2. 

Segel (5) states that a value of .25 is the 
minimum required for useful differentiation. 
On this basis only seven pairs of sub-tests pro- 
vide useful diagnostic information, two of these 
pairs being members of the neurotic triad. If 


these values are verified in larger samples of 
comparable students it would appear that this 
test offers limited possibilities for diagnosis 
from the various sub-tests in the type of sample 
used in this study. 


! 
Relationship to Success in Student Teaching 


Success in student teaching was appraised by 
university supervisors who have been using the 
same rating form for several years. Correla- 
tions of the sub-tests with the supervisor’s rat- 
ings are shown in Table 3. 

None of the sub-tests correlates high enough 
with student teaching to offer possibilities of 
predicting success in student teaching with any 
reasonable degree of accuracy. 

In order to analyze further the differential 
power of the sub-tests, the sample of 56 students 
was divided into three groups on the basis of 


Table 3 


Correlations between Sub-Tests and Ratings in 
Student Teaching 








Correlations 


Sub-Tests with Rating 





Hypochondriasis 
Depression 
Hysteria 
Psychopathic deviate —.13 
Masculinity-femininity —.05 
Paranoia 14 
Psychasthenia 01 
Schizophrenia 09 
Mania — .06 


—.05 
— .08 
— 36 
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Table 4 


Significance of Mean Differences between 
High and Low Groups 





Mean of 
Low 
Group 


Mean of 
High 


Sub-Test Group 





3.82 
19.41 
19.88 
12.12 


3.86 04 
20.21 50 
23.21 2.57 
14.43 1.86 


Hypochondriasis 

Depression 

Hysteria 

Psychopathic deviate 

Masculinity- 
femininity 

Paranoia 

Psychasthenia 

Schizophrenia 

Mania 


37.65 
10.29 
9.12 
8.47 
14.12 


38.21 36 
8.86 1.85 
9.29 08 
7.29 65 

16.57 1.52 





* With 29 degrees of freedom ¢ must be 2.045 to be 
significant at the .05 level, and 2.756 at the .01 level. 


the supervisor’s ratings, high (N = 17), aver- 
age (N = 25),and low (N = 14). The/-ratios 
(4) between the means of the high and low 
groups are given in; Table 4. 

Only one of the sub-tests, ysleria, differen- 
tiates sharply between the most and least suc- 
cessful student teachers. The difference on 


this sub-test is significant at the .02 level. Psy- 


chopathic deviate and paranoia are significant 
at the .10 level and should be studied further in 
alargersample. - ~ 

Low differential power has also been reported 
by Stough (6) in a similar study carried out 
with male high school teachers. Stough, how- 
ever, did report differences between high and 
low groups on depression and schizophrenia 
which were significant at the .01 level, and on 
psychasthenia at the .05 level.‘ 


Summary 


1. The reliabilities of the sub-tests of the 
Minnesota Multiphasic Personality Inventory 
reported in this study aré fairly high with the 
exception of paranoia and mania. 

2. The intercorrelations between sub-tests 
are low, but only a few pairs of sub-tests have 
diagnostic value insofar as the present sample 
is concerned. 

3. In general, the sub-tests have very little 
value for predicting success in student teach- 
ing. However, the fact that there were dif- 
ferences between high and low groups on hys- 
teria, psychopathic deviate, and psychasthenia 
suggests that these be given additional study 
with other samples. Similar study should also 
be made of several new scales which are now 
available for this inventory. 


Received June 5, 1950. 
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Correction for “Women Students in Liberal Arts, Nursing, and 
Teacher Training Curricula and the MMPI” 


Orpha Maust Lough 
Rutherford, New Jersey 


In the Journal of Applied Psychology, 1947, 
31, 437-445, a study was reported in which the 
Minnesota Multiphasic Personality Inventory 
was given in a liberal arts college to 115 un- 
married women students, 61 of whom were 
cadet nurses and 54 liberal arts students, and 
in a state teachers college to 185 unmarried 
women students, 111 of whom were enrolled in 
the Music Curriculum and 74 in the General 
Curriculum for elementary school teachers. 
For Table 2 on p. 440 of the 1947 issue should 
be substituted Table 2, revised, presented on 
p. 126 of this issue. 

From the data given in Table 2, revised, on 
the criterion of a critical ratio of 2.6 or more 
for statistical significance, there are reliable 
differences between the cadet nurses and the 
students in the General Curriculum on the 
Interest (M;) scale, and between the cadet 


nurses and the Music students on the Hypo- 
chondriasis (H,), Depression (D), Interest (M;), 


and Schizophrenia (S,) scales. These data 
indicate that the students in the General Cur- 
riculum and the Music students are reliably 
more feminine in their interests than the cadet 
nurses; or, the latter are more masculine in 
their interests than the former. It also ap- 
pears that on the basis of the Hypochondriasis 
(H,) scale, the cadet nurses are significantly 
more psychologically mature, tending to re- 
spond to adult problems with more adequate 
insight and less disposed to seek sympathy or 


to develop undue anxiety over their health, 
than the Music students. The cadet nurses 
are also significantly more self-confident and 
extrovertive, less prone to bizarre and unusual 
thoughts or behavior, and have better morale 
than the Music students. These data confirm 
and substantiate the trends analyzed in the 
original report. 

In the Summary of the original report, sub- 
stitution should be made, as follows: 

2. There are statistically significant differ- 
ences (critical ratio of 2.6 or more) between the 
cadet nurses and the students in the General 
Curriculum on the Interest (My) scale, and 
also between the cadet nurses and the Music 
Students on the Hypochondriasis (H,), Depres- 
sion (D), Interest (Mr), and Schizophrenia (S,) 
scales. Note: In former paper, no statistically 
significant differences on the separate scales 
were reported. 

4. On the basis of this study . . . The Min- 
nesota Multiphasic Personality Inventory is 
not a useful instrument for differentiating be- 
tween those who are more suited for one occu- 
pation than another except, perhaps, for women 
desiring to prepare for the nursing profession. 
Even here, the differences are of such small 
magnitude, even though statistically signifi- 
cant, as to have little practical guidance or 
selection import. 


Received January 26, 1951. 
Early publication. 
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Ortho-Rater Color Vision Test Compared with the Ishihara 
and the Pseudo-Isochromatic Plates 


Newell C. Kephart and Melvin J. Tieszen 
Division of Applied Psychology, Purdue University 


Tests of color vision utilizing pseudo-iso- 
chromatic plates provide a simple and rapid 
means for detecting the presence of defective 
color vision. Two such tests, the Ishihara and 
the Pseudo-Isochromatic Plates are used exten- 
sively in industry as well as in other fields. An 
additional test utilizing pseudo-isochromatic 
plates which is used in industry is the color 
vision test in the Ortho-Rater.! 

Purpose of Investigation. While the Ishihara 
test and the Pseudo-Isochromatic Plates have 
become accepted as “valid” detectors of defec- 
tive color vision, the particular function tested 
by the Ortho-Rater color vision test is in ques- 
tion (1,6). The purpose of this investigation 
is to compare the performance on the color 
vision test in the Ortho-Rater with perform- 
ance on the Ishihara test and the Pseudo- 
Isochromatic Plates. 

The seventh edition of the Ishihara test was 
used throughout the present investigation. 
This edition consists of 32 plates, of which the 
24 intended for the testing of literate subjects 
were used. Hardy, Rand, and Rittler (3) have 
identified 18 plates from the Pseudo-Isochrom- 
atic series which they found to be a good screen- 
ing device for color defectives. 

The color vision test in the Ortho-Rater is 
the F-7 slide situated at the optical equivalent 
of eight meters. This slide is a Kodachrome 
transparency of four Ishihara plates (6). Two 
of the plates contain a single digit, while two 
are two-digit plates. The scoré is the number 
of separate digits correctly responded to, a 
“perfect” score being six. It is stated in the 
manual accompanying the Ortho-Rater (10, 
p. 6) that, “The present slide is temporary 
and classifies subjects finely only at the low 
levels of color discrimination ability.” No 
claims are made that the test differentiates 
types of color defectiveness. 

1 The Ortho-Rater is a visual testing device manu- 
factured by the Bausch and Lomb Optical Company 


for the visual classification and placement of industrial 
employees. 


Standard procedure was used in administer- 
ing the three sets of tests to two groups, one 
composed of 110 and the other of 52 male col- 
lege students. The purpose of giving the tests 
to a second group was twofold: (1) to obtain 
an estimate of the reliability of the original 
comparison; and (2) to cross-validate a pro- 
posed scoring key on the Ortho-Rater color 
vision test. 

The Ishihara test and the Pseudo-Isochro- 
matic Plates were used as criteria for the classi- 
fication of subjects as color normal and color 
defective. The performance scores of the two 
groups thus classified on the Ortho-Rater color 
vision test were compared. A revised scoring 
of the responses on the Ortho-Rater tests based 
solely on a statistical analysis of the responses 
made by the color normals and color defectives 
will be considered in the results. 


Results for Group I 


Performance Scores on Tests. Presented in 
Table 1? is a tabulation of the number of sub- 
jects responding correctly to various numbers 
of the 24 plates in the Ishihara test. It is 
shown that 14 subjects failed to respond cor- 
rectly to as many as 14 plates and so were 
classified as color defective. 

Shown in Table 2 are the number of subjects 
making correct resporises to the 18-plate selec- 
tion of the Pseudo-Isochromatic Plates. Fail- 
ure to respond correctly to five or more plates 
on this abbreviated test is considered indicative 
of defective color vision. Accordingly, 14 sub- 
jects are classified as color defectives. These 
14 subjects are the same subjects so classified 
on the Ishihara test. 

2 Tables 1, 2, 3, 4, 5, and 6 and, Figures 1, 2, 3, and 4 
present the detailed data. These tables have been 
deposited with the American Documentation Institute. 
Order Document 2917 from American Documentation 
Institute, 1719 N St., N.W., Washington 6, D. C., remit- 
ting $0.50 for microfilm (images 1 inch high on standard 


35 mm. motion picture film) or $0.50 for photocopies 
(6 X 8 in.) readable without optical aid. 
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The number of subjects earning performance 
scores ranging from 1 to 6 on the Ortho-Rater 
color vision test is shown in Table 3. It is 
apparent from the distribution that no dichot- 
omous grouping has resulted. If the critical 
score were set at the performance score of two, 
that is, if those who failed to respond correctly 
to more than two digits were classified as color 
defectives, 71 per cent, classified as color defec- 
tives by the Ishihara and Pseudo-Isochromatic 
Plates would be correctly classified; five per 
cent, classified as color normals by the other 
two tests, would be misclassified. Setting the 
critical score at the performance of three, 100 
per cent, classified as color defective by the 
criterion tests, would be correctly classified; 
ten per cent, classified as color normals, would 
be misclassified. 

Revision of Ortho-Rater Scoring Key. Exam- 
ination of responses to the digits on the Ortho- 
Rater test reveals that 100 per cent of the color 
normal group and 29 per cent of the color defec- 
tive group read the correct response of five on 
the first plate. On the second plate, 100 per 
cent of the color normal group and 93 per cent 
of the color defective group made the correct 
response of eight. Eighty-two per cent of the 


color normal group made the correct response 


of five to the first digit of the third plate, while 
only 15 per cent of the color defective group 
made this response. - Seventy-one per cent of 
the color defective group could read no digit on 
the third plate. Twelve per cent of the color 
normal group could read no digit on the third 
plate. Twelve per cent of the color normal 
group read the digit six instead of five, while 
none of the color defective group made ‘his 
response. Seventy-seven per cent of the color 
normal group made the correct response of six 
to the second digit on the third plate. Four- 
teen per cent of this group read the digit eight 
in place of six, while none of the color defective 
group made this response. On the fourth plate 
82 per cent of the color normal group made the 
response of seven to the second digit on the 
fourth plate, 27 per cent of this group read two 
for seven. Seventy-one per cent of the color 
defectives failed to see any digit on plate four, 
and none made the response of two for seven 
for the second digit. 
Since some of the “wrong” responses made 
- by those classified as having normal color vision 
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by the Ishihara test and the Pseuo-Isochro- 
matic Plates appear to be typical and are not 
common to the responses made by the color 
defective group, it is possible to devise a scoring 
key that will better differentiate between the 
color normals and color defectives. It is sug- 
gested that the scoring of plates one and two be 
continued as it isnow. In plate three it is sug- 
gested that six be an alternate answer for five 
and that eight be an alternate answer for six. 
In plate four it seems advisable that the two be 
scored as is now done, but that two also be an 
alternate answer for seven. 

The acceptance of three additional digits as 
correct or normal responses in plates three and 
four would produce the change in results graph- 
ically presented in Figures 1 and 2. The num- 
ber of color normals making a perfect perform- 
ance score of six increases from 32 to 61 per 
cent. Establishing a critical score of three 
would result in 100 per cent of the color defec- 
tives being correctly classified. Three per cent 
of the color normals would be misclassified as 
color defectives instead of the ten per cent mis- 
classified by the scoring key presently used. 


Results for Group II 


Performance Scores on Tests. In Table 4 it 
is shown that 11 of the 52 subjects composing 
the second group responded correctly to less 
than 60 per cent of the Ishihara plates and so 
are classified as color defectives. 

The same eleven subjects were placed in the 
color defective classification by the Pseudo- 
Isochromatic Plates. As for the original test 
group, there was complete agreement in the 
classification of subjects by the Ishihara test 
and the Pseudo-Isochromatic Plates. 

The tabulation of the number of subjects 
passing from one to six of the digits in the Ortho- 
Rater color vision test is given in Table 6. As 
in the previous group, no subject failed all six 
digits. It is again necessary to determine a 
critical score that will best separate the color 
defectives from the color normals. Setting the 
critical score at a performance score of two, 91 
per cent, classified as color defectives on the 
Ishihara test and the Pseudo-Isochromatic 
Plates would be correctly classified; 2 per cent, 
classified as having normal color vision by the 
two criterion tests, would be misclassified. Set- 
ting the critical score at a performance score of 
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three, 91 per cent, classified as color defectives 
on the Ishihara test and the Pseudo-Isochro- 
matic Plates, would be correctly classified; 17 
per cent, classified as color normals on the 
criterion tests would be misclassified. Whereas 
71 per cent of the color defective subjects in 
Group I had performance scores of two or less, 
91 per cent of the color defectives in Group II 
fall at these performance scores; the other nine 
per cent earn a score of four. Ten per cent of 
the color normals of the first group earned a 
performance score of three or less, whereas 17 
per cent of the color normals in the second 
group fell below a performance score of three. 

Revision of Ortho-Rater Scoring Key. Anan- 
alysis of responses to the digits on the Ortho- 
Rater test reveals that a like or greater per- 
centage of color normals in Group II gave the 
responses of six for five and eight for six in 
plate three and two for seven on plate four 
than did the color normals in Group I. None 
of the color defectives in Group II made these 
specific responses. The effect of the proposed 
revision of scoring on the distribution of the 
second group’s performance scores is graphi- 
cally presented in Figures 3 and 4. If a per- 


formance score of three is accepted as the crit- 


ical score, 17 per cent of the color normals and 
nine per cent of the color defectives would be 
misclassified by the present method of scoring. 
The revised key results in four per cent of the 
color normals and nine per cent of the color 
defectives in Group II being misclassified. 


Summary 


Two groups of male college students, one 
containing 110 subjects and a second contain- 
ing 52 subjects were administered the Ishihara 
test, the Pseudo-Isochromatic Plates, and the 
color vision test in the Ortho-Rater. 

The results of testing the group of 110 sub- 
jects showed perfect agreement in the classifi- 
cation of normal and defective color vision on 
the Ishihara test and the Pseudo-Isochromatic 
Plates when criterion scores developed by 
Hardy, Rand, and Rittler (4) were used. Es- 
tablishment of a critical score at a performance 
score of three (reading correctly three of the six 
digits) on the Ortho-Rater color vision test 
resulted in 100 per cent of the subjects with 
defective color vision and ten per cent of the 
subjects with normal color vision falling below 
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this critical score when the classification by the 
other two tests is accepted. A revised scoring 
key resulted in a reduction from ten per cent to 
three per cent in the number of subjects with 
normal color vision who were misclassified; the 
scores of subjects with defective color vision 
remained unchanged. 

The Ishihara test and the Pseudo-Isochro- 
matic Plates agreed in the classification of de- 
fective color vision and normal color vision on 
the second group. Ninety-one per cent of the 
subjects with defective color vision and 17 per 
cent of the subjects with normal color vision, 
as classified on the basis of the other two tests, 
fell below a critical score of three on the Ortho- 
Rater color vision test. The proposed new 
Ortho-Rater key reduced the number of mis- 
classified subjects with normal color vision from 
17 per cent to four per cent; the scores of the 
subjects with defective color vision remained 
unchanged. 


Received May 11, 1950. 
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A Study of Tachistoscopic Vision 


Newell C. Kephart and James D. Pecsok 
Occupational Research Center, Purdue University 


Although many studies have been made upon 
the span of apprehension as measured tachis- 
toscopically, and the effects of tachistoscopic 
training upon this apprehension, none could 
be found referring to the effects of tachis- 
toscopic exposure upon perception. Since 
many industrial jobs today, particularly those 
involving inspections, require split-second per- 
ception, it seems important that some studies 
be made in this field. The present paper does 
not attempt to discover the loss of perceptive 
power upon tachistoscopic exposure, but is 
merely the first step in an attempt to determine 
the correlation between a standard vision test 
(in this case the Ortho-Rater) and tachisto- 
scopic scores (at $ second exposure). 


The Experiment 


Equipment. The equipment used included, 
in addition to such items as chairs and a table, 
two white screens to be used for the back- 
ground; a set of checkerboard targets calibrated 
at 20/80, 20/67, 20/48, 20/40, 20/34, 20/28, 
20/24, 20/20, and 20/17; a half-silvered mirror; 
an electronic timer; and an Ortho-Rater. This 
equipment was set up as shown in Figure 1, 
with the half-silvered mirror making an angle 
of 45 degrees with the table edge, and the 
screens adjusted so that the distance from the 
subject’s eye to either screen was 20 feet. 
Light was furnished by Lumiline bulbs and 
was equated at 28 foot-candles on each screen. 
The timer was connected so that light would 
normally be on only Screen A, the only excep- 
tion being the interval after the time switch 
was thrown (# second here), at which time it 
would be on Screen B only. Through the 
half-silvered mirror, the subject could see only 
the screen which was lighted. The checker- 
board targets were pasted on 8 inch squares of 
white cardboard, with the target edge parallel 
to the edge of the cardboard. 

Subjects. The subjects were students in the 
beginning course of Psychology at Purdue Uni- 
versity. In most cases they were tested with 
either eye, so that 108 results were obtained 
from 46 males and 15 females. 

Method. All subjects were first given the 
complete battery of Ortho-Rater tests in the 
prescribed form, except that all monocular 


measurements were made with the other eye 
occluded. After the completion of the tests 
they were asked to sit so that their chin was 
on the chin rest, and the peep hole was adjusted 
so as to permit them to see the screen with one 
eye only. They were told to fixate with one 
eye upon an 8 inch square target on Screen A. 
This square was iden‘ical with those on which 
the targets had been pasted, and was so placed 
that a fixation point at its center would appear 
to be in the same place as the center of the 
checkerboard targets. 

The following instructions were then given: 
“I am going to flash on the screen targets 
exactly like the ones you just saw in the Ortho- 
Rater. The checkerboard will always be on 
the top, bottom, right, or left, never in the 
center. Please call out where you see the 
checkerboard.’”’ In order to give them an idea 
of what was to come, the largest, or 20/80, 
target was placed on Screen B and shown once. 
If the subject failed to understand what was 
expected, it was shown again. In almost every 
case there was no trouble, however, and the 
target was shown nine more times, in the 
order RR TRLBLTB. These showings 
were made as rapidly as the target could be 
manipulated, with the subject being given a 
warning of approximately one second before 
each showing. 

After a rest of thirty seconds, the second 
largest, or 20/67, target was shown ten times, 
also as rapidly as possible. This sequence of 
thirty seconds rest and then ten showings of 
the target was continued, using the targets in 
order of largest to smallest, until the subject 
failed to get at least five correct answers. 
(Five was arbitrarily decided to be the passing 
point after talks with several of the subjects 
who had gotten four correct responses had 
convinced the experimenter that they were 
guessing.) 

The order of presentation of the targets was 
as follows: 


Target 
Classification 
20/80 
20/67 
20/48 
20/40 
20/34 
20/28 
20/24 
20/20 


Order 
TRRTRLBLTB 
LBLTBTRRTR 
BTTRLRBTRL 
RBTRLBTTRL 
TRRTRLBLTB 
LBLTBTRRTR 
BTTRLRBTRL 
RBTRLBTTRL 


Target 


1 
2 
3 
4 
5 
6 
7 
8 
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The sequence for targets 1 and 3 was picked 
by cutting a deck of cards, that for targets 2 
and 4 is the same except that the starting point 
is presentation No. 5 instead of No. 1. Tar- 
gets 5, 6, 7, and 8 were presented in the same 
order as were 1, 2, 3, and 4. 

Scoring. Since, as previously mentioned, it 


was decided to make five correct responses the 
minimum for which any credit was to be given, 
it was necessary to derive a formula that would 
give four correct responses a score of zero. In 
order to accomplish this, the empirical formula 


= ROW) 10 was developed, where R 


is the number of correct responses and W is the 
number of incorrect responses (counting no 
response at all as being incorrect). The figure 
thus obtained was applied to the difference 
between the denominators of the target at 
which a miss occurred and the next larger 
target, and the result was subtracted from the 
larger number to give the score on the smaller 
target. As an example, suppose the following 
to be the scores of one subject: 


Target ‘20/80 20/67 20/48 20/40 20/34 20/28 
No.Rt. 10 10 9 7 5 4 


Through the 20/67 target the subject would get 
full credit, or a score of 67. For the 20/48 target 
(9 — (2/3)1) 10 oe 
wa. wen x 19 = 16. 
This number, when subtracted from 67, would 
give him a score of 51 through the 20/48 tar- 
get. For the 20/40 target his score would be 
(7 — (2/3)3) 10 = 
100 xX 8 = 4. 
from 51, gives him a score of 47 through the 
20/40 target. Similarly, for 20/34 his score is 
Sa es) xX 6 = 1, giving him a total 


his score would be 





This, subtracted 


score of 46. The 20/28 target is disregarded, 
as the credit is zero. 

Since, in the Snellen system, the difference 
between 20/50 and 20/55 is not the same as the 
difference between 20/30 and 20/35, it was 
decided to change these scores to percentage of 
visual efficiency by using the table on page 126 
of Tiffin (8). It must be remembered, how- 
ever, that these scores a: tive, and not 
absolute. That is, the iact that a subject 
emerged with a score of 70 does not mean that 
he is only 70% efficient under the conditions 
of the experiment 

After the efficiency scores had been obtained 
in this manner they were grouped by threes, 
and these grouped scores were used in all 
calculations. 


Results 


The correlation between the tachistoscopic 
scores and the Ortho-Rater scores for vision 
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with one eye at distance was calculated to be 
.68 + .05. The generally accepted figure for 
the reliability of the Ortho-Rater is .865. A 
test-retest reliability check gave .81 as the reli- 
ability of the tachistoscope test. Using .81 as 
the reliability of the tachistoscopic tests, and 
.865, as the reliability of the Ortho-Rater, the 
above correlation when corrected for attenua- 
tion is .81. The results in Figure 2 show the 
mean tachistoscopic score for individuals mak- 
ing each Ortho-Rater score. It can be seen 


MEAN VISUAL EFFICIENCY SCORES, % 
{TACHISTOSCOPE) 








8 9 10 " 12-UP 


ORTHO-RATER SCORES 


Tachistoscopic scores vs. Ortho-Rater acuity 
scores (one eye at distance). 








132 


that there is a rather marked relationship be- 
tween these two variables. 


Summary 


It seems safe to say that there is a definite 
relationship between visual acuity as measured 
on the Ortho-Rater and as measured tachisto- 
scopically in this experiment. The critical 
ratio of the Pearsonian r between the two is 
over 13, which is significant beyond all reason- 
able doubt. The actual value of the r seems 
still open to doubt, however, as the sample used 
in the present study was undoubtedly biased. 
Since the experiment was purely voluntary on 
the part of the subjects, and since they knew 
in advance that it was to be a vision experi- 
ment, it is probable that there was some pre- 
selection in that a number of those with poor 
vision simply did not enter into it. A still 
greater cause of doubt was necessitated by the 
limitations of the target used. The largest 
target available was calibrated at 20/80 and 
there is a segment of the population which is 
unable to see that well even under normal con- 
ditions, let alone under those which existed 
during the test. The results of subjects who 
were unable to see the first target were omitted 
from the calculations. Since the Ortho-Rater 
scores for these subjects were invariably low, 
such omissions would tend to give a spuriously 
low correlation figure. . 

In view of the relatively high r between 
tachistoscopic acuity as measured in this study 
and Ortho-Rater acuity scores obtained in the 
customary manner, it does not appear that any 
marked difference exists between the two types 


Newell C. Kephart and James D. Pecsok 


of acuity measurement. It is therefore felt 


that, for industrial purposes, adequate acuity 
tests administered in the customary manner 
will give satisfactory results in selection and 
placement of personnel and that the addition 
to a vision test battery of tachistoscopic meas- 
ures of acuity would be of questionable value. 


Received May 17, 1950. 
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An Experimental Consumer Panel Technique 


Edwin A. Fleishman 
The Ohio State University 


Experimental investigations of product pref- 
erences are usuilly confined to the laboratory 
situation with a restricted sample or to simple 
spot tests made in conjunction with consumer 
surveys. Previous experiments of these types 
have been made with shaving creams (11), 
breads (6), cigarettes (1, 4), and cola drinks 
(5, 7,8,9,10). Inanexperiment by the writer 
(3), preferences for a product were allowed to 
develop over a period of time. The study 
reported here was a further attempt to obtain 
a larger sample of behavior regarding consumer 
preferences than usually results from traditional 
techniques. Moreover, the experimental in- 
vestigation was conducted at the consumer 
level, using a panel of families selected from a 
cross-section of a large metropolitan city. The 
products investigated included six well-known 
brands of bottled beer. 


The Problem 


The purpose of the study was to observe the 
formation of beer preferences during a seven- 
day period, under conditions in which all means 
for identifying the beers except actual brand 
names were available to the subjects. 


The Method 


The procedure included: (1) the selection of 
an unbiased panel of twenty families; (2) sup- 
plying them with unlabeled bottles of beer for 
a seven day period; and (3) observing and 
recording their preferences during that time. 

The Consumer Panel. The panel of beer 
consumers included families living in different 
sections of the city. The same proportion of 
families from each of the four socio-economic 
groups was included in the sample as is found 
in the general beer-drinking population of that 
city.! Fifteen per cent of the families were 
Negro. In the sample, were beer drinkers 

1TIn a previous study (2) it was found that the “A” 
socio-economic group consumed 4.2% of the beer sold 
in this city, the “B” group consumed 7.3%, the “C” 


grou 38.1%, and the lowest group, “D,” consumed 
4%. The panel contained approximately these ratios 


whose expressed preferences included many dif- 
ferent brands. The number of families whose 
previous preferences included each of the brands 
in the study was equated. 

An important factor in selecting the families 
(as in all panel techniques) was their availa- 
bility and their willingness to cooperate. This 
had been determined during a previous survey 
by the writer involving 250 respondents. The 
families within the classifications mentioned 
above were then selected randomly. The po- 
tential subjects were reinterviewed to assure 
their cooperation and their understanding of 
the study. 

Procedure. The experiment lasted for seven 
days, during which time the families drank as 
many bottles of beer as they wanted. The 
study was conducted during the hot summer 
nionths when consumption was at a maximum. 


The experimental bottles of beer were pro- 
vided the families each morning. Each day 
each home received 48 bottles of beer in blank 
cases. Included were eight bottles of each of 
six different brands from which they could 
choose during the rest of the day. The bottles 
contained no labels or brand names. Instead, 
the tops on the bottles of each brand were 
painted with different colors. Thus, the tops 
of one brand might all be painted red, the tops 
of a second brand might all be green, and so on. 
All beers of the same brand would have tops of 
the same color on any one day. The subjects 
were told that the colors would be reassigned 
to the brands by random selection from day to 
day. Thus, a brand might or might not have 
the same color top from one day to the next. 
The subjects had to make a new set of choices 
each day as to what they “liked” or “didn’t 
like.” The subjects were not told which six 
brands were included in the experiment. All 
the bottles were of the same size, shape and 
color. The subjects could not distinguish be- 
tween the brands on the basis of beer color. 
During the study they drank no beer except 
that provided by the experimenter. 

The subjects kept records of the brands they 
drank (according to the top color) on the check 


of families within each socio-economic | og Thus, 
e 


of the 20 families selected, one was from “A” group, 
two from the “B” group, seven from the “C” group, 
and the remaining ten from the “D” group. 
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form provided each day. They also expressed 
their preferences for that day on the space pro- 
vided on the form. Each day the remaining 
bottles, bottle caps, and empties were collected, 
the forms checked against the bottles returned, 
and the home provided with the next day’s 
supply. 

The subjects were provided with more than 
they needed so that they could have an un- 
restricted choice of brands. On the weekend 
they were provided with twelve of each brand 
instead of the customary eight. During the 
course of the study frequent follow-up calls 
were made to insure that instructions were 
being followed and that there were no mis- 
understandings. The families were also pro- 
vided with a set of printed instructions. 


The study operated on the basic assumption 
that the people would drink more of what they 
liked and tend to avoid more of brands they 
disliked. Since the colors on the six brands 
were changed from day to day, a new set of 
decisions was made each day. The data offer 
two indices of preference. One includes the 
differences between the brands in the number 
of bottles of each consumed during the seven 
day period. The other is represented in the 
day by day tabulation of the brands (according 
to top color) named as preferred by the indi- 
viduals in the study. 


Results 


The results show that there were significant 
differences between brands in the number of 
bottles of each consiimed by the panel of con- 
sumers. Table 1 presents the total number of 
bottles of each brand consumed during the 
seven day period and the percentage of the 
total this represents for each brand. 


Table 1 


Absolute and Relative Frequency with Which Bottles 
of Each Brand of Beer Were Consumed 
During the Seven Day Period 





Number of 
Bottles 
Consumed 
625 18.4 
613 17.9 
591 17.4 
566 16.6 
514 15.1 
497 14.6 


3406 


Percentage 
f 


o! 
Total 
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Table 2 


Percentage of Times Each Brand Was Named as 
“Liked Best” During the Course 
of the Study 





Percentage 
“Liked Best” 


23.2 
21.4 
18.7 
17.1 
11.9 

7.7 








The chi square based on Table 1 was 16.7 
which is significant beyond the .01 level. In 
other words, there appeared to be real differ- 
ences in preference for beers even when the 
brand names were not known. A more com- 
plex analysis, which involved successive elimi- 
nation of different brands and combinations of 
brands from the analysis revealed that the 
majority of the variation was a result of the 
avoidance of Brands Eand F. The differences 
between the number of bottles of Brands A, 
B, and C consumed were not significant. 
Brand D was not preferred significantly more 
than the last two brands and was not avoided 
significantly less than the leading three brands 
in the study. 

In addition to drinking most of what they 
liked, all the subjects were requested to write 
in on the daily record sheet they received, the 
color of the top on the brand they liked best 
thatday. The number preferring each “color” 
was tabulated under the appropriate brand 
each day. Table 2 summarizes the proportion 
of times each brand was named as preferred 
over the remaining five during the seven days. 
In all, 327 choices were made. 

The relationship between expressed prefer- 
ence and the amount of each brand actually 
consumed is shown to be high. 

An analysis of the day to day variation in the 
number of bottles of each brand consumed daily 
yielded a chi square which was not significant 
(chi square = 22.4, d.f. = 25, P>.05). Thus, 
although there were significant differences 
between brands in the total number of bottles 
of each consumed, there was little shift from 
day to day in the brands preferred and avoided. 
The trend was generally maintained through 
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the week. The two avoided brands were con- 
sistently avoided, and the brand high each day 
generally varied among the three preferred 
brands. 

An analysis of differences in preference be- 
tween the families in the panel, however, 
showed wide inter-family differences in the 
number of bottles of each brand consumed. 
Thus, every brand was preferred by some fam- 
ily during the study. However, more of the 
families formed preferences for Brands A, B, 
and C, than for the other brands. 

It was also found that preferences formed 
were for the most part unstable and most sub- 
jects did not actually pick out the same brands 
from day to day. In other words, the indi- 
viduals contributing to the total number of 
bottles consumed of a particular brand during 
the week, differed from day to day. Thus, 
although more of the subjects liked Brands A, 
B, or C, more of the time and avoided Brands 
E and F more of the time, individual prefer- 
ences generally tended to shift from day to day. 

The relationship between previous brand 
preferences and preferences formed during the 
experiment was also investigated. The per- 
centage of times Brand A drinkers said they 
liked Brand A during the study, the percentage 
of Brand B drinkers who named Brand B (by 
colored top, of course), and so on, was as fol- 
lows: Brand A drinkers, 56.3 per cent; Brand D 
drinkers, 45.2 per cent; Brand C drinker: 28.9 
per cent; Brand B drinkers, 26.3 per cent; and 
Brand F drinkers, 12.5 per cent. Thus, users 
of Brands A and D seemed more apt to form 
preferences for their old brands when the brand 
names were not known than did users of other 
brands. The percentage of times drinkers of 
Brands B, C, and F picked their own brands 
are within the limits of chance expectancy. 


Summary 


A panel of consumers was selected from a 
large city and their preferences for brands of 
beer investigated through the use of an experi- 
mental technique. 

1. The study showed that there were real 
differences in preferences for beers even when 
the brand names were not known. 

2. Of the beers investigated, three were gen- 
erally preferred, one occupied a position just 
below the leaders, and two were more generally 
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avoided. This was on the basis of the total 
number of bottles consumed during the study. 

3. The preferences expressed by the subjects 
during the study were in agreement with the 
amount actually consumed. 

4. The general trend in preference for the 
families in the study generally prevailed for the 
group from day to day. 

5. Preferences expressed by the individuals 
in the study generally proved to be unstable 
and shifted from one brand to the next. How- 
ever, more people preferred three of the brands 
and avoided two others more of the time. 

6. However, previous users of two of the 
brands seemed better able to form preferences 
for their old brands when the brand names 
were not known than users of the other brands. 

7. The study indicates that important infor- 
mation may be gained using experimental tech- 
niques at the consumer level. The fruitfulness 


of this approach to problems of market research 
should increase with more refined techniques 
and research projects of longer duration. 


Received May 6, 1950. 
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Book Reviews 


Hattwick, M. S. How to use psychology for 
better advertising. New York: Prentice- 
Hall, Inc., 1950. Pp. xvi + 376. $5.75. 
In a stimulating and entertaining book Dr. 

Hattwick shows that the application of the 
findings of psychology can lead to the successful 
practice of advertising. Although the book is 
written in popular, almost telegraphic style, 
the author includes countless experimental 
studies and carefully chosen illustrations to 
indicate the fruitful utilization of scientific 
methods in a field which is still fluid in terms 
of methodology as well as findings. 

The book is divided into four major parts 
and twenty-one chapters. Part 1 (2 chapters) 
is entitled “What Do Consumers Want?” and 
concerns itself with a discussion of human needs 
and means of their satisfaction. Part 2 (14 
chapters) iscalled “Six Steps That Make Adver- 
tising More Effective,” and deals with prob- 


lems of consumer research, the make-up of the 
market, finding the right appeal, attention- 
getting devices, keeping the consumer’s inter- 
esteand impressing his memory. 


Part 3 (3 
chapters) discusses “Methods of Evaluating 
Advertising Effectiveness,” and Part 4 (2 chap- 
ters) concludes the book with a discussion on 
“Building Customer Cenfidence in Advertis- 
ing.”” Concise summaries and selected refer- 
ences are included at the end of each chapter. 

The pleasant, rambling organization, per- 
haps one of the greatest assets of the book, also 
offers drawbacks, especially from the view-point 
of the teaching psychologist. There might be 
more theoretical discussions of motivation, 
emotion, attention, perception or learning, 
although pertinent material from all of these 
areas can be found throughout the book. There 
is no adequate treatment of the assumptions of 
the scientific method, although this type of 
discussion would certainly have a place in a 
book designed to sell science to the practi- 
tioner. There are some inaccuracies in psy- 
chological theory, such as a listing of a specific 
number of basic human wants, or a statement 
that “practically all persons ... have the 
same fundamental drives (wants) that are 


usually satisfied in about the same ways” (p. 
24). 
All in all Dr. Hattwick’s book should be 
highly appreciated by the practical advertiser 
who is not familiar with the contributions of 
psychology to the knowledge in his field; the 
teacher of psychology will find it very useful 
as an up-to-date basic text, provided he pre- 
sents some of the theoretical material from 
other sources. 
Irving R. Weschler 


University of California, Los Angeles 


Harrel, Thomas Willard. Industrial psychol- 
ogy. New York: Rinehart and Company, 
Inc., 1949. Pp. xvii + 462. $4.25. 

The objectives of this book are clearly indi- 
cated in the author’s preface when he states 
that today’s industrial psychology has as its 
primary function the analysis of human rela- 
tions in industry rather than the use of tests. 
One has a feeling while reading the first 
third of the text, however, that the author has 
not been entirely willing to give himself over to 
this ‘‘new” approach, for he discusses at some 
length the kinds of tests and their uses. Un- 
fortunately the ‘‘uses” sections are often too 
brief to be really instructive. In several in- 
stances the author seems hesitant about recom- 
mending validated methods and says “this 
could be used”; “this might be used.” The 
reviewer feels that users of industrial psychol- 
ogy’s methods are ready for more positive sug- 
gestions. 

One section of the text is titled “Human En- 
gineering.” Perhaps this would be a proper 
time to voice an objection to the use of the 
term “human engineering” when one speaks of 
materials, equipment, and environmental fac- 
tors. Engineering procedures are reflected in 
human behavior and are appraised in terms of 
human response. The use of the term by com- 
mercial vocational testing groups also adds to 
the confusion. This leads the reviewer to sug- 
gest that industrial psychologists might profit 
by deciding to use a less confusing term. 

Most of this section is concerned with brief 
descriptions of those areas which constitute 
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special subject matter fields in advanced psy- 
chology and industrial management. The 
absence of material on equipment design is 
particularly apparent. 

When the author returns to his basic objec- 
tive of dealing with the analysis of human rela- 
tions in industry the text makes a definite gain 
in interest value and practical application of 
the psychologist’s knowledge. The excellent 
bibliography on attitudes and job satisfaction 
cannot, however, overcome the over-detailed 
enumeration of studies and the small emphasis 
on evaluation of the chapter on that subject. 
There are many reviews of the kinds of surveys 
and where they were made but little or no 
reference is made to the problems solved or 
the issues met by these surveys. The data of 
the survey do not comprise the end to be 
sought, but are only a means for reaching a 
solution to an industrial problem. 

The illustrations used in the discussion of 
morale and monotony are sometimes hard to 
follow, which suggests hurried writing and not 
too careful editing. This comment also ap- 
plies to some of the tables which are full page 
and are not clearly delineated as tables. They 
appear almost as text material. 

In the chapter on psychological aspects of 
labor relations the author fulfills his objective 
in a most adequate fashion and gives other text- 
book writers in industrial psychology cause for 
revising their presentations to make the mate- 
rial more applicable to the real analysis of 
human relations in industry. It turns atten- 
tion from the tools of the psychologist to the 
service he has to offer in the field of industrial 
problems. This change in perspective is in 
itself a strong recommendation for the book. 
No teacher or consultant in industrial psychol- 
ogy can afford to overlook this important point 
of view. 

Eleroy L. Stromberg 


Western Reserve University 


Deming, W. E. Some theory of sampling. New 
York: Wiley, 1950. Pp. xvii + 602. $9.00. 
It is not a criticism of this work to say that 

only a few psychologists will find it useful, 

since the volunie is mainly concerned with the 
theory of large-scale sampling of the survey 
type. The chief emphasis is on distribution 
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theory and the determination of appropriate 
sampling error formulas that go with various 
sampling schemes. Those whose calculus is 
rusty will find the going tough at times, espe- 
cially in the chapter devoted to the gamma and 
the beta functions. 

The book is not entirely mathematical in 
character; dispersed throughout one finds many 
practical suggestions and many examples illus- 
trating the use of various sampling methods. 
The excellent nontechnical discussion in the 
first two chapters will repay careful reading by 
anyone who is interested in the difficulties that 
beset sampling surveys. 

Quinn McNemar 

Stanford University 


Noland, E. William, and Bakke, E. Wight. 
Workers wanted. New York: Harper and 
Brothers, 1949. Pp. xi + 233. $3.00. 

This study of employers’ hiring policies, pref- 
erences, and practices in New Haven, Connecti- 
cut and Charlotte, North Carolina is the third 
Interim Report issued under the auspices of the 
Yale Labor and Management Center. 

The study is based on guided interviews held 
in 1945 and 1946 with the persons responsible 
for setting hiring policy and directing hiring 
practice in 135 companies in New Haven and 
105 companies in Charlotte. These companies 
employed approximately eighty per cent of the 
industrial labor force and represented all major 
industries (businesses excluded) in the two 
cities. Interview data were reduced to 99 
tables, and the book is essentially a discussion © 
of these tables. 

The first two chapters give an overview and © 
a discussion of the order of importance (based 7 
on requirements and preferences) assigned by ~ 
employers to seventeen employee characteris- ~ 
tics such as character, personality traits, expe- 
rience, family status, religion, etc. Each of the 
next five chapters deals with a particular occu- 
pational group: production workers, common 
laborers, service and maintenance workers, rou- 
tine clerical workers, and executive and admin- 
istrative assistants. The next two chapters 
deal with methods used by employers to deter- 
mine the extent to which applicants meet re- 
quirements and preferences, and sources of 
potential employees. The last chapter devel- 
ops a picture of motivations, compulsions, and 
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organizational requirements of management, 
and suggests implications regarding the em- 
ployer, community, and worker. Appendices 
give a note on methodology and 58 of the tables 
which form the background of the book. 

The data gathering, analysis, and report indi- 
cate the close attention to minute detail which 
is characteristic of sound research. The auth- 
ors appear to have extracted the last gram of 
value from their data, whereas many others 
would have been content with a more super- 
ficial and less productive analysis. The book 
abounds with penetrating insights, interpreta- 
tions, and speculations. 

The book is a popularized interim report, 
makes reference to only three other publica- 
tions, contains no bibliography, and contains 
no index. The result is a highly readable report 
of value to personnel psychologists, sociologists, 
economists, industrial executives, union lead- 
ers, and others interested in employment. 

C. E. Jurgensen 


Minneapolis Gas Company 


Jennings, Helen Hall. 
relations. 


Sociometry in group 
Washington, D. C.: American 


Council on Education, 1948. 2nd printing 
1949. Pp. ix + 85. 


$1.25. 


The quality and effectiveness of human asso- 
ciation, both for personal and social ends, has 
become the subject of scientific scrutiny in 
many settings. To what extent can the micro- 
cosm of the classroom yield answers and orig- 
inate useful methods for the improvement of 
social relations? 

Jennings and her associates in the project in 
Intergroup Education in Cooperating Schools 
has supplied a guide for teachers in the use of 
sociometric methods. Techniques for select- 
ing situations, framing questions, tabulating 
choices, and constructing sociograms are de- 
scribed in detail. However, the pamphlet is 
more than a manual, for leads for the analysis 
of the process back of the group structure are 
supplied with much insight based on previous 
experiment and experience. Suggestions for 
interviews, for individual treatment, for struc- 
tural changes, and for the appraisal of growth 
through successive sociograms are given. Il- 
lustrations are presented of problems arising 
from social, race, and religious differences and 
of promising ways for improvement. 


Book Reviews 


While the report is based on classroom set- 
tings the range of implications and applications 
is over the entire field of interpersonal relation- 
ships. 


Willard C. Olson 
University of Michigan 


Hamrin, Shirley A., and Paulson, Blanche B. 
Counseling adolescents. Chicago: Science 
Research Associates, Inc., 1950. Pp. 380. 
$3.50. 

This volume, the first book to be published 
in SRA’s new Professional Guidance Service, 
will probably find itself most appreciated in 
guidance and education classes in college. It 
presents the counseling process in a simple, 
unsophisticated manner without any of the 
esoteric terminology or circumlocution which 
so often abounds in this type of literature. In 
presenting techniques utilized in counseling, 
the authors claim to be eclectic and state their 
views in non-technical and easily comprehen- 
sible language. They have given us an ele- 
mentary introduction to the field and have per- 
sonalized a good portion of the content, the 
words of wisdom being put into the mouths of 
two paragons of psychological virtue. The 
authors employ a common sense approach to 
the needs of adolescents and to their various 
pitfalls on the road to maturity and in so doing, 
they introduce wholesome ideas and attitudes 
in their own natural and unaffected way. Their 
treatment of psychological needs and mecha- 
nisms is an unpretentious one with emphasis 
being laid on bringing about an integrated 
adjustment in both the individual and environ- 
ment rather than on altering one to suit the 
other. Each point is well demonstrated by 
the two fictionalized counselors who reappear 
through most of the chapters. 

The authors consider counseling as a special- 
ized function of school preparation and feel 
that counseling techniques should be specifi- 
cally designed for the different age groups. 
Counselors, it is felt, should understand general 
psychological concepts, motives, mechanisms, 
modes of adjustment and specific problems of 
the adolescent group. Furthermore, the coun- 
selor should understand and be able to handle 
skillfully the counseling process in its various 
phases. The content of the book deals consec- 
utively with the concept of behavior, the mat- 
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uration process, philosophies of and attitudes 
toward counseling, types of problems in the 
adolescent group and necessary traits for the 
competent counselor. 

The greatest value of this book may be found 
in its departure from a rigid or stereotyped 
counseling philosophy and in its freer and more 
individualized method of dealing with adoles- 
cent problems. It should be valuable for under- 
graduate college classes because of its sound 
and understandable advice; but its content 
appears to be too elementary for it to be of 
any real help to the experienced counselor or 
teacher. It covers what a counselor needs to 
know and how he should use this knowledge, 
but it is not provocative, raises no new issues, 
and makes no distinctive contribution to the 
guidance field. ; 


Jewel B. Kaufman 
University of Michigan 


Britt, Steuart Henderson. Social psychology 
of modern life. (Rev. Ed.) New York: 
Rinehart & Co., Inc., 1949. Pp. xvi + 703. 
$4.50. 


This revision parallels the organization and 
content of the first edition and continues to be 
quite readable and interesting. Its many illus- 
trations, suggested readings, discussion ques- 
tions commend it for teaching purposes. In 
certain areas, recent literature has been thor- 
oughly culled, increasing the length by over a 
hundred pages. 

The very extensiveness of the book is one of 
its shortcomings; touching on so many topics, 
Britt has not gone deeply into any. The criti- 
cism of the first edition—superficiality—still 
applies. And for this, he has supplied his own 
best criticism: “After all, he (the student) says, 
if what we are learning now (facts which may 
change) may not be true, why should we bother 
to learn it?” (p. 419). There seem to be 
too many “facts” presented without enough 
attention to the methods for obtaining them. 

Without claiming to be a systematist, Britt 
does use certain organizing concepts, such as 
learning principles and Thomas’ four wishes. 
But, instead of lending meaning to the data, 
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they give a hobby-horse effect: movement with- 
out progress. 

It is also difficult to tell, despite the first 
chapter, whether this is a text in general psy- 
chology, social psychology, or sociology. The 
inclusion of so much material on learning, the 
nervous system, etc., seems unnecessary. The 
groups with which he deals, such as the church, 
the gang, and the nation, are sociological classi- 
fications. Britt has not made the transfer to 
the psychologist’s interest in the effects on or 
the origins of these in the individual. 

The few actual errors of commission are not 
serious but should not be in a revision. For 
example, he cites without comment the conclu- 
sion of a study in which, because of a very high 
correlation, it is said that causality may be con- 
sidered ; he calls a series of symmetrical curves 
normal, though they differ in kurtosis; his prin- 
cipal criticism of the Thurstone technique deals 
with its difficulty, which is over-exaggerated. 
While not an error, to refer to E. L. Thorndike 
as ‘“‘a world famous psychologist” without nam- 
ing him seems strange in a text book. 

The most serious difficulty, to this reviewer, 
lies in what has been omitted. Thomsen, re- 


viewing the first edition, found it amazing that 
such industrial work as that of Roethlisberger 
and Dickson should have been left out; in the 
present edition, two short paragraphs seem to 


suffice. Industrial work is sorely lacking, an 
unusual situation considering Britt’s specialty. 
To a considerable extent, this is still a study of 
the social behavior of college sophomores. The 
uses of propaganda in the hot and cold wars 
have not affected the treatment of that topic; 
the major contribution of psychology during 
the war seems to have been the “Psychologists’ 
Manifesto’’; no real changes have been made in 
the treatment of the totalitarian state; no sig- 
nificant material on men at war has been added. 

The student using this book will find it inter- 
esting; he will learn many facts about certain 
aspects of current social life. How much he 
will learn about the bases for social living, the 
psychological origins of groups, why men behave 
as they do in their working worlds remains a 
serious question. 


John W. Gustad 
Vanderbilt University 
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